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Adenovirus Vectors, Packaging Cell Lines, 
Compositions, and Methods for Preparation and Use 



This application is a continuation-in-part of U.S. Application 
09/423,783 filed November 12, 1999 and claims the benefit of the filing date of 
U.S. Provisional Application 60/1 15,920 filed January 14, 1999. The contents 
of each application are incorporated herein by reference. 

Statement as to Rights to Inventions Made Under 
Federally Sponsored Research and Development 

This invention was made with U. S . government support under NIH Grant 
No. HL 54352. The government has certain rights in the invention. 

Technical Field of the Invention 

The present invention relates to gene therapy, especially to adenovirus- 
based gene therapy. In particular, novel packaging cell lines are disclosed, for use 
in facilitating the development of high-capacity and targeted vectors. High- 
capacity adenovirus vectors are also disclosed herein, as are related compositions, 
kits, and methods of preparation and use of the disclosed vectors, cell lines and 
kits. 

Background of the Invention 

Enhanced transfer of DNA conjugates into cells has been achieved with 
adenovirus, a human DNA virus which readily infects epithelial cells (Horwitz, 
"Adenoviridae and Their Replication", in Virology, Fields and Knipe, eds., Raven 
Press, NY (1990) pp. 1679-1740). 

Although adenovirus-mediated gene therapy represents an improved 
method of DNA transfer into cells, a potential limitation of this approach is that 
adenovirus replication results in disruption of the host cell. In addition, 
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adenovirus also possesses oncogenic properties including the ability of one of its 
proteins to bind to tumor suppressor gene products. The use of so-called 
replication defective strains of adenovirus (which typically possess E1A and/or 
E1B deletions that render the virus unable to replicate in host cells) is in principle 
5 more suitable for in vivo therapy; however, the potential of co-infection of 

epithelial cells with wild-type strains of virus resulting in transactivation of the 
recombinant virus may represent a significant safety concern for in vivo 
applications. 

Another undesirable aspect of using intact or replication-competent 

10 adenovirus as a gene transfer means is that it is an oncogenic virus whose gene 

products are known to interfere with the function of host cell tumor suppressor 
proteins as well as immune recognition molecules, such as the major 
histocompatibility complex (MHC). In addition, pre-existing circulating antibodies 
to adenovirus may significantly reduce the efficiency of in vivo gene delivery. 

1 5 Lastly, only a foreign gene of 6 kilobases (kb) or less can be incorporated into the 

intact adenovirus genome for gene transfer experiments, whereas DNA segments 
of greater than 12 kb can be transferred using the methods of this invention. 

In order to make Ad vectors more replication-incompetent, some 
investigators have attempted to construct recombinant Ad-derived vectors which 

20 have nearly all of their genome deleted, except for portions known to be required 

for packaging of virus particles. For example, helper-dependent vectors lacking 
all viral ORFs but including essential cis elements (the inverted terminal repeats — 
ITRs - and the contiguous packaging sequence) have been constructed, but the 
virions package less efficiently than the helper and package as multimers part of 

25 the time, which suggests that the virus may "want" to package a fuller DNA 

complement (see, e.g., Fisher, et al, Virology 217: 1 1-22, 1996). Mitani et al 
(Proc. Natl Acad. ScL USA 92: 3854-3858, 1995) also describe a helper- 
dependent Ad vector that was apparently not completely replication-defective. 
Amalfitano, et al {Proc. Natl Acad. Sci. USA 93: 3352-3356, 1996) 

30 describe the construction of Ad packaging cell lines that support the growth of 
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E 1 - and polymerase-deleted Ad vectors, in an effort to block the replication of Ad 
vectors in vivo. Similarly, Armentano, et al (Hum. Gene Ther. 6: 1343-53, 
1995) describes Ad vectors with most — but not all — of the E4 sequence deleted 
therefrom. However, since such a small amount of genetic material is deleted 
5 from the vectors, their ability to transport therapeutic sequences is rather limited. 

Published International App. No. WO96/14061 describes efforts to construct 
packaging cell lines containing nucleotide sequences encoding El and ORF6 of 
E4. 

In addition to being able to incorporate large amounts of DNA into a 
10 vector, the ability to target the vectors to specific cell types will result in more 

efficient administration of desired therapeutics. Such targeting of adenovirus in a 
relatively simple system is one of the advantages of the current invention. 

Thus, there is a need in the art to obtain Adenovirus vectors 1) capable of 
incorporating large segments of foreign DNA and capable of being targeted to 
15 specific cells, as well as to obtain cell lines which can package such adenovirus- 

gene deficient vector or targeted vectors. These needs, as well as others, are met 
by the invention. 

Brief Summary of the Invention 

This invention utilizes recombinant adenovirus constructs which duplicate 
20 the cell receptor binding and DNA delivery properties of intact adenovirus virions 

and thus represents an improved method for gene therapy and cell targeting as 
well as for antisense-based antiviral therapy. 

In contrast to the disadvantages of using intact adenovirus, modified 
adenovirus vectors requiring a helper plasmid or virus, or so-called replication- 
25 deficient adenovirus in the art, the use of recombinant adenovirus-derived vectors 

according to one aspect of the present invention provides certain advantages for 
gene delivery. First, the Ad-derived vectors of the present invention possess all 
of the functional properties required for gene therapy including binding to 
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epithelial cell receptors and penetration of endocytic vesicles. Therapeutic viral 
vectors of the present invention may also be engineered to target the receptors of 
and achieve penetration of non-epithelial cells; means of engineering viral vectors 
to accomplish these ends are described in detail herein below. 
5 Second, the vectors of the present invention have deletions of substantial 

portions of the Ad genome, which not only limits the ability of the Ad-derived 
vectors to "spread" to other host cells or tissues, but allows significant amounts 
of "foreign" (or non-native) nucleic acids to be incorporated into the viral genome 
without interfering with the reproduction and packaging of the viral genome. 
_ 10 Therefore, the vectors of the present invention are ideal for use in a wide variety 

: of therapeutic applications. 

Third, while the vectors disclosed herein are safe for use as therapeutic 
) agents in the treatment of a variety of human afflictions, some of these vectors 

; do not require the presence of any "helpers" for propagation and packaging, 

15 largely because of the novel cell lines in which they are reproduced. Such cell 

lines ~ referred to herein as packaging cell lines — comprise yet another aspect of 
the invention. 

; To reduce the frequency of contamination with wild-type adenovirus, it 

is desirable to improve either the viral vector or the cell line to reduce the 

20 probability of recombination. For example, an adenovirus from a group with less 

homology to the group C viruses may be used to engineer recombinant viruses 
with little propensity for recombination with the Ad5 sequence contained in the 
packaging lines. The invention describes the preparation of packaging cells lines 
which stably expresses adenovirus proteins or polypeptides. These cell lines are 

25 useful for complementing viral vectors bearing deletions of regulatory and/or 

structural genes, irrespective of the serotype from which such a vector was 
derived. 

It is also contemplated that the constructs and methods of the present 
invention will support the design and engineering of chimeric viral vectors which 
30 express amino acid residue sequences derived from two or more Ad serotypes. 



Thus, unlike methods and constructs available prior to the advent of the present 
disclosure, this invention allows the greatest possible flexibility in the design and 
preparation of useful viral vectors and cell lines which support their construction 
and propagation — all with a decreased risk of recombining with wild-type Ad to 
produce potentially-harmful recombinants. 

In part, the present invention discloses a simpler, alternative means of 
reducing the recombination between viral and cellular sequences than those 
discussed in the art. One such means is to increase the size of the deletion in the 
recombinant virus and thereby reduce the extent of shared sequences between that 
virus and any Ad genes present in a packaging cell line e.g., the Ad5 genes in 293 
cells, or the various Ad genes in the novel cell lines of the present invention. 

Deletions of all or portions of structural genes of the adenovirus have been 
considered undesirable because of the anticipated deleterious effects such deletions 
would have on viral reproduction and packaging. Indeed, the use of "helper" 
viruses or plasmids has often been recommended when using Ad-derived vectors 
containing large deletions in structural protein sequences precisely for this reason. 

Contrary to what has been suggested in the art, however, this invention 
discloses the preparation, propagation and use of recombinant Ad-derived vectors 
having deletions of all or part of various gene sequences encoding Ad structural 
proteins, both as away of reducing the risk of wild-type adenovirus contamination 
in virus preparations, as a way of allowing foreign DNA to be packaged in such 
vectors for a variety of diagnostic and therapeutic applications and as a way of 
targeting an adenovirus vector to a specific cell type. 

The invention further discloses a wide variety of nucleic acid sequences 
and viral vectors. Thus, in one embodiment, the invention discloses a nucleic acid 
sequence encoding any one of the adenovirus fiber proteins mentioned in the 
specification, polypeptides or fragments thereof ~ including, without limitation, 
those that include deletions or other mutations; those that are chimeric; and those 
that have linkers, foreign amino acid residues, or other molecules attached for 
various purposes as disclosed herein. Nucleic acid sequences encoding various 
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other adenovirus structural and/or regulatory proteins or polypeptides are also 
within the scope of the present invention. 

In various embodiments, the adenovirus is a Group C adenovirus selected 
from serotypes 1, 2, 5 or 6; while in other embodiments, adenovirus selected from 
5 other serotypes, such as for example Ad37 (subgroup D) are useful as disclosed 

herein. 

The invention is also directed to an isolated nucleic acid molecule 
comprising an adenovirus tripartite leader (TPL) nucleotide sequence, said TPL 
nucleotide sequence comprising (a) first and second different TPL exons or (b) 

10 first, second and third same or different TPL exons, said TPL exons selected from 

the group consisting of complete TPL exon 1, partial TPL exon 1, complete TPL 
exon 2 and complete TPL exon 3 . A preferable embodiment of the invention may 
further comprise an intron operatively linked to the TPL, wherein said intron also 
contains requisite processing signals for the intron' s removal. Another preferable 

1 5 embodiment of the invention is directed to the isolated nucleic acid molecule 

wherein said TPL nucleotide sequence consists essentially of complete TPL exon 
1 operatively linked to complete TPL exon 2 operatively linked to complete TPL 
exon 3. A related embodiment may further include an intron and appropriate 
processing signals. Additional embodiments of the invention are directed to 

20 nucleic acid molecules contained in plasmids selected from the group; consisting 

of pCLF, pDV60, pDV67, pDV69, pDV80 and PDV90. Packaging cell lines and 
adenovirus particles containing the nucleic acids described above are also included 
in the invention. 

The invention is further directed to methods for producing an adenovirus 
25 vector particle containing a helper-independent fiberless recombinant adenovirus 

vector genome comprising providing a) a packaging cell line which complements 
replication and packaging of said genome and b) a helper-independent fiberless 
recombinant adenovirus vector genome which is deficient in expressing sufficient 
functional fiber protein to support assembly of fiber-containing particles. The 
30 genome is introduced into the cell line. Additional embodiments of the invention 
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may also include the following steps; a) growing the cell line produced under 
conditions for producing particles; and/or b) harvesting an adenovirus vector 
particle containing said helper-independent fiberless recombinant adenovirus 
vector genome. The method may also include a cell line that expresses a fiber 
5 protein and complements a fiber mutation in the vector. 

The invention is also directed to an adenovirus vector packaging cell line 
comprising a stably integrated nucleic acid molecule as described above, an 
operatively-linked promoter and a nucleic acid sequence which encodes an 
adenovirus structural protein, wherein said TPL sequence consists essentially of 

10 a first TPL exon operatively linked to a complete second TPL exon operatively 

linked to a complete third TPL exon. Preferably, the cell line may have a complete 
or partial first TPL exon. Another embodiment of the invention comprises 
adenovirus structural protein, such as adenovirus fiber protein or a chimeric 
protein which includes an adenovirus fiber protein tail domain. 

15 The invention is further directed to a recombinant adenovirus particle 

comprising a recombinant adenovirus vector genome wherein said genome; (a) 
does not encode or does not express sufficient adenovirus fiber protein to support 
packaging of a fiber- containing adenovirus particle without complementation of 
said fiber gene, and (b) encodes an adenovirus packaging signal and inverted 

20 terminal repeats containing adenovirus origin of replication. The invention is also 

directed to a helper-independent fiberless recombinant adenovirus vector genome 
comprising genes which (a) encode all adenovirus structural gene products but do 
not express sufficient adenovirus fiber protein to package a fiber-containing 
adenovirus particle without complementation of said fiber gene or said genome 

25 lacks at least the fibre gene and (b) encodes an exogenous protein. Either of the 

above embodiments may substitute a helper-dependent for a helper-independent 
recombinant adenovirus vector genome. In a preferable embodiment, no fiber 
protein is expressed. In yet another embodiment of the invention, the recombinant 
adenovirus particle fails to express sufficient fiber protein to allow fiber 
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incorporation into the particle such that the particle can use the fiber pathway for 
infection. 

The invention is further directed to a method for producing an adenovirus 
vector particle containing a helper-independent fiberless recombinant adenovirus 
5 vector genome, said method comprising providing a packaging cell line which 

complements replication and packaging of said genome and a helper-independent 
fiberless recombinant adenovirus vector genome which is deficient in expressing 
sufficient functional fiber protein to support assembly of fiber-containing particles 
and harvesting said particles produced by said cell line. The method may also 
1 0 comprise a step of coating (i. e. providing fiber protin in any way) to a particle with 

? an adenovirus fiber protein. In a preferable embodiment the adenovirus particle 

z comprises an exogenous protein or a modified fiber protein. 

Another aspect of the invention is directed to a method for pseudotyping 
recombinant viral vectors comprising complementing a missing fiber gene of a 
J l 5 helper-independent fiberless recombinant adenovirus vector genome by expressing 

| in packaging cells a fiber gene from a different adenoviral serotype than said 

l recombinant adenovirus vector, thereby pseudotyping said vector. An additional 

: embodiment of the invention is directed to the method for pseudotyping 

1 recombinant viral vectors comprising; a) providing a packaging cell line for 

20 propagating a fiber gene deleted recombinant adenovirus vector, b) introducing 

into said cell line a helper-independent fiberless recombinant adenovirus vector 
genome, and c) complementing the missing fiber gene by expression in the cells 
of a fiber gene from a different adenoviral serotype thereby pseudotyping the 
vector. 

25 The invention is further directed to a method for specifically targeting an 

adenovirus vector to a cell of choice comprising providing a packaging cell line 
for producing a fiber gene-deleted adenovirus vector and providing a helper- 
independent fiberless recombinant adenovirus vector genome, wherein said gene 
for a missing fiber protein is complemented with a gene for a desired modification 

30 for targeting the vector to a cell of choice. 
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The invention is farther directed to a method for producing a modified 
adenovirus comprising providing in vitro an exogenous fiber protein to a fiberless 
adenovirus. Additional embodiments of the invention may provide any 
combination of all of the following steps such that the invention be directed to a 
5 method for producing a modified adenovirus comprising; a) providing a 

packaging cell line for producing a fiberless adenovirus vector, b) introducing into 
said cell line a helper-independent fiberless or helper-dependent fiberless 
recombinant adenovirus vector genome, c) growing and harvesting a fiberless 
adenovirus, d) maintaining the fiberless adenovirus in any suitable buffer, and e) 
10 providing exogenous fiber, wherein said fiber may be a modified fiber, to the 

O fiberless adenovirus by adding conditioned media or a soluble fiber preparation or 

'E; a fiber in any suitable buffer to a virus preparation thereby producing the modified 

adenovirus. 

ifl The invention is further directed to a method for producing a modified 

f y 1 5 adenovirus comprising providing a packaging cell line for producing a helper- 

JL.. dependent fiberless adenovirus vector genome and providing a helper virus vector, 

H b wherein said cell line complements at least a deficient fiber protein gene, thereby 

\z producing the modified adenovirus. Another aspect of the invention is directed 

to a method for producing a modified adenovirus comprising: a) providing a 
20 packaging cell line for producing a fiberless adenovirus vector, b) introducing into 

said cell line a helper dependent fiberless recombinant adenovirus vector genome 
and a fiberless helper virus vector, c) growing and harvesting a fiberless 
adenovirus, and d)maintaining the fiberless adenovirus in infectious media, and e) 
providing exogenous fiber to the fiberless adenovirus by adding conditioned media 
25 or a soluble fiber preparation to a virus preparation thereby producing the 

modified adenovirus. 

Additional aspects of the invention are directed to hybrid Ad/AAV vectors 
and to new helper-dependent vectors used with fiberless adenovirus vectors. 

The invention is also directed to a method for delivering a heterologous 
30 gene to an EBV-infected B cells comprising infecting said B cells with a 



- 10- 

pseudotyped Ad5 pgal. A F particle or other fiber-deleted adenovirus particle, said 
particle having a chimeric fiber with the receptor-binding knob domain of the 
adenovirus type 3 fiber. 

The invention is also directed to an isolated nucleic acid comprising a post- 
transcriptional regulatory element (PRE) and a TPL. Preferably the PRE is the 
woodchuck hepatitis virus PRE (WPRE). 

The invention is further directed to a composition for preparing a 
therapeutic vector, said composition comprising a plasmid comprising an 
adenovirus genome lacking a nucleotide sequence encoding a fiber protein or a 
genome that is incapable of expressing sufficient fiber to result in packaging. 

Another aspect of the invention is directed to a method of delivering a 
heterologous gene to a human or any animal comprising providing an exogenous 
gene to a target cell comprising contacting said cell in vivo or ex vivo with an 
amount of a recombinant adenovirus particle sufficient to infect said cell. 

The invention is also directed to A method for producing a gutless 
adenoviral vector particle comprising: a) delivering a helper adenovirus vector 
genome to an adenovirus vector packaging cell, wherein said helper adenovirus 
vector genome lacks any gene encoding adenovirus fiber protein or lacks the 
ability to encode sufficient adenovirus fiber protein to produce an adenoviral 
vector comprising fiber protein in the absence of complemetation by said packing 
cell and wherein said packaging cell comprises the nucleic acid molecule of claim 
2 operably linked to a promoter and to an adenoviral fiber protein or to a chimeric 
protein that includes an adenovirus fiber protein tail domain; (b) delivering a 
gutless adenovirus vector genome to said packaging cell; and (c) recovering the 
gutless adenoviral vector particle produced by said cell. 

Another aspect of the invention is directed to a helper adenovirus particle 
comprising an adenovirus vector genome that does not encode or does not express 
sufficient adenovirus fiber protein to support packaging of a fiber- containing 
adenovirus particle without complementation of said fiber gene, wherein said 
genome has a mutation in its packaging sequence that renders said genome 
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substantially incapable of being packaged. Packaging sequence are those 
sequences are those sequences involved in packaging the viral particle. 

The invention is further directed to a helper adenovirus particle comprising 
an adenovirus vector genome with recombinase sites flanking its packaging 
5 sequence, wherein said vector genome does not encode or does not express 

sufficient adenovirus fiber protein to support packaging of a fiber-containing 
adenovirus particle without complementation of said fiber gene. 

The invention is also directed to an adenovirus particle comprising a 
gutless adenoviral vector genome and a fiberless capsid, as well as an adenovirus 

10 particle comprising a gutless adenoviral vector genome and a capsid comprising 

a modified fiber protein. 

Another aspect of the invention is directed to a packaging cell for the 
production of a fiberless or fiber-modified gutless adenovirus particle comprising 
an adenovirus vector complementing plasmid and a nucleotide sequence encoding 

15 a recombinase, wherein said complementing plasmid comprises the nucleic acid 

molecule of claim 2 operably linked to a promoter and to a nucleotide sequence 
encoding an adenoviral fiber protein or a chimeric adenoviral fiber protein. 
Preferably the cell line may comprise a recombinase. In an embodiment of the 
invention the recombinase may be Cre. 

20 In another embodiment of the invention, the fiber-deleted adenovirus 

vectors of the invention and the fiber-complementing adenovirus packaging cells 
of the invention are used to produce a gutless adenovirus vector particle. Such 
particle comprises a gutless adenoviral vector genome in an adenoviral capsid. 
The fiber proteins of the capsid may be wild-type fiber, or the modified fiber 

25 proteins disclosed herein. Alternatively, such particle may have a fiberless capsid 

as disclosed herein. Preferably, the gutless genome contains at least one 
heterologous gene as described herein. As used herein, the term "gutless 
adenoviral vector genome" means an adenoviral vector genome from which all of 
the viral genes have been deleted. 
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The invention also discloses systems or kits for use in any of the 
aforementioned methods. The systems or kits may contain any appropriate 
combination of the within-described vectors, plasmids, cell lines, virus particles 
and additional therapeutic agents as disclosed. Preferably, each such kit or system 
5 includes a quantity of the appropriate therapeutic substance or sequence sufficient 

for at least one administration, and instructions for administration and use. Thus, 
one system further comprises an effective amount of a therapeutic agent which 
enhances the therapeutic effect of the therapeutic viral vector- containing 
composition. Another variation discloses that the composition and the therapeutic 
10 agent are each included in a separate receptacle or container. 

*i It will also be appreciated that any combination of the preceding elements 

may also be efficacious as described herein, and that all related methods are also 
nj within the scope of the present invention. 

m Brief Description of the Drawings 

ul 5 Figure 1 is a schematic diagram of the entire adenoviral E4 transcriptional 

z~ unit with the open reading frames (ORF) indicated by blocked segments along 

£3 with the promoter and terminator sequences. The location of primers for 

amplifying specific portions of E4 are also indicated as further described in 

Example 1A. 

20 Figure 2 is a schematic map of plasmid pE4/Hygro as further described in 

Example IB. 

Figure 3 is a schematic map of plasmid pCDNA3 /Fiber as further described 
in Example IB. 

Figure 4 is a schematic map of plasmid pCLF as further described in 
25 Example IB. 

Figure 5 is a photograph of a Southern blot showing the presence of intact 
adenovirus E4 3. 1 kilobase (kb) insert in the 21 1 cell line as further described in 
Example 1C. 
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Figure 6 is an autoradiograph showing labeled fiber protein 
immunoprecipitated from cells and electrophoresed under native and denaturing 
electrophoresis conditions as described in Example 1C. The 293 cells lack fiber 
while the sublines 211 A, 21 IB and 21 1R contain fiber protein detectable in 
functional trimerized form and denatured monomeric form. 

Figure 7 is a schematic map of plasmid pDEX/El as further described in 
Example ID. 

Figure 8 is a schematic map of plasmid pEl /Fiber as further described in 
Example 1F1. 

Figure 9 is a schematic map of plasmid pE4/Fiber as further described in 
Example 1F2). 

Figure 10 is a schematic illustration of linearized pD EIBb gal delivery 
plasmids for use in cotransfection and recombination to form a recombinant 
adenoviral vector having multiple adenoviral gene deletions. The plasmids and 
recombination event are more fully described in Example 2A. 

Figure 1 1 is a schematic of plasmid p 1 1 .3 as further described in Example 
2A used in the construction of pDV44 delivery plasmid. 

Figure 12 is a schematic of plasmid 8.2. 

Figure 13 shows the trimeric structure of the recombinant fiber. 293 , 
211 A, 21 IB, or 21 1R cells as indicated were metabolically labeled with 
[ 35 S]methionine, soluble protein extracts prepared, and fiber was 
immunoprecipitated. A portion of the precipitated protein was electrophoresed 
on an 8% SDS-PAGE gel under either semi-native or denaturing conditions. The 
positions of trimeric (T) and monomeric (M) fiber are indicated. As a control for 
electrophoretic conditions, recombinant Ad2 fiber produced in baculovirus- 
infected cells was run under identical conditions and stained with Coomassie blue. 

Figure 14 shows the complementation of a fiber mutant adenovirus by 
fiber-producing cells. The cell lines indicated (2x1 0 6 cells per sample) were 
infected with the temperature-sensitive fiber mutant adenovirus H5M42 at 10 
PFU/cell and incubated at either the permissive (32.5 °C, stippled bars) or the 
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restrictive (39.5 °C, solid bars) temperature. 48 hours post-infection, virus was 
isolated by freeze-thaw lysis and yields determined by fluorescent focus assay on 
SW480 cells. Each value represents the mean of duplicate samples, and the data 
shown is representative of multiple experiments. 
5 Figure 1 5 shows the incorporation of the recombinant Ad5 fiber into Ad3 

particles. In Figure 15 A, the alignment of the N-terminal (penton base-binding) 
domains of fiber proteins from several different adenovirus serotypes is shown. 
From top to bottom, the five different serotypes are listed as SEQ ID NOs 21-25. 
In Figure 15B, type 3 adenovirus was propagated in 293, 21 IB, or 21 1R cells as 

10 indicated and purified by two sequential CsCl centrifiigations. 10 jig of the 

purified viral particles was then electrophoresed under denaturing conditions and 
transferred to a P VDF membrane. Ad5 fiber was detected with a polyclonal rabbit 
antibody raised against recombinant Ad2 fiber. As a positive control for detection, 
400 ng of wild-type Ad2 was run in the lane marked "Ad2". Under these 

1 5 conditions, the mobilities of the Ad2 and Ad5 fibers are indistinguishable and the 

antibody reacts with both proteins. 

Figure 1 6 shows the fiber deletion in pDV44 and the genomic structures 
oftheAd5.pgal.AF and Ad5.Pgal.wt vectors: Figure 1 6 A shows pD V44 that was 
constructed by removing the fiber gene and residual E3 sequences (nt 

20 30819:32743 of AD5) from pBHGl 0. Figures 16B shows viruses constructed by 

cotransfection of either pBHGlO or pDV44 with pAElBPgal. Both are E1/E3 
deleted Ad5 vectors, and Ad5.Pgal.AF has the additional fiber (L5) deletion as in 
pDV44. 

Figure 17 shows the analysis of the viral chromosomes. Figure 17A 
25 shows the predicted EcoRI restriction maps of Ad5.Pgal.wt and Ad5.Pgal.AF. 

The 5 . 9 kb fragment at the right end of the Ad5 . Pgal. wt genome is reduced to 4.0 
kb by the deletion of fiber sequences in Ad5.Pgal.AF. Figure 17B shows an 
ethidium bromide-stained gel of EcoRI-digested viral DNA. Figure 17C shows 
a Southern blot of the gel as described in Example 2 probed either with labeled 
30 fiber or E4 sequences. 
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Figure 1 8 shows the analysis of vertex proteins in the viral particles. 293 
(non-fiber expressing) or 21 IB (fiber-expressing) cells were infected with 
Ad5.pgal.wt ('wt') or with Ad5.pgal.AF ('AF') and the resulting viral particles 
were purified on CsCl gradients. 10 (ig of purified virions was then 
electrophoresed on 5-16% gradient gels and Western blotted. Proteins were 
detected with polyclonal anti-fiber or anti-penton base antibodies. 

Figure 1 9 shows the infectivity of Ad particles on THP-1 monocytic cells. 
Figure 19A shows THP-1 cells that were infected with Ad5.Pgal.wt or with 
fiberless Ad5 . Pgal. AF at 1 00,000 particles/cell. Forty-eight hours after infection, 
cells were fixed and stained with X-gal and the fraction of infected cells was 
determined by light microscopy. Figure 19B shows cells that were infected with 
1000 particles per cell of Ad5.pgal.wt or with 100,000 particles/cell of 
Ad5.Pgal.AF. As indicated, cells were pretreated with 100 |ig/ml of recombinant 
penton base or with 20 jig/ml of recombinant Ad2 fiber. 

Figure 20 shows a schematic of improved fiber-complementing cell lines, 
633 and 644 as further described in the Examples. 

Figures 21 and 22 illustrates pseudotyping of fiberless particles with fiber 
proteins and infectivity data as further described in the Examples. 

Figure 23 shows the Clal to Bglll fragment of Ad5. 

Figure 24 shows the plasmid pGRE5-2/EBV 

Figure 25 shows the plasmid pGRE5-El. 

Figure 26 shows the plasmid pSE280-E2 BamHI-SmaL 

Figure 27. The fiber-deleted adenovirus vector Ad5.Pgal.AF was grown 
in cells expressing either no fiber (293; £ Ad5.pgaLAF/0'), the Ad5 fiber (633; 
£ Ad5.pgal. AF/5F'), or the Ad37 fiber with modifications as described in the text 
(705; £ Ad5.pgal.AF/37F')andCsCl-purified. 10 pg of the purified particles were 
electrophoresed and transferred to a nylon membrane. As controls, 10 jig of 
wild-type Ad37 or the fiber gene-containing vector Ad5.pgal.wt or a sample of 
purified recombinant Ad37 fiber knob were also run. The blot was probed with 
polyclonal antisera against recombinant Ad37 fiber or Ad2 fiber proteins. As a 
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loading control, the same filter was reprobed with an antibody against the Ad2 
penton base (the anti-Ad2 sera cross-recognized the very similar Ad5 fiber and 
Ad5 penton base proteins). 

Figure 28 shows PCR analysis for fiber presence. 

Figure 29 shows the transduction efficiency for fiberless virus with and 
without soluble fiber. 

Figure 30 shows the transduction efficiency of ADSBgF" onHDF cell line 
with the presence of different amounts of 633 conditioned media. 

Detailed Description of the Invention 

To reduce the frequency of contamination with wild-type adenovirus, it is 
considered desirable to improve either the viral vector or the cell line to reduce the 
probability of recombination. For example, an adenovirus from a group with less 
homology to the group C viruses may be used to engineer recombinant viruses 
with little propensity for recombination with the Ad5 sequence in 293 cells. 
Similarly, an epithelial cell line - e.g. the cell line known as 293 ~ may be used or 
further modified according to within-disclosed methods which stably expresses 
adenovirus proteins or polypeptides from Ad3 and/or proteins or polypeptides 
from another non-group-C or group C serotype; such a cell line would be useful 
to support adenovirus-derived viral vectors bearing deletions of regulatory and/or 
structural genes, irrespective of the serotype from which such a vector was 
derived. 

It is also contemplated that the constructs and methods of the present 
invention will support the design and engineering of chimeric viral vectors which 
express amino acid residue sequences derived from two or more Ad serotypes. 
Thus, unlike methods and constructs available prior to the advent of the present 
disclosure, this invention allows the greatest possible flexibility in the design and 
preparation of useful viral vectors and cell lines which support their construction 
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and propagation ~ all with a decreased risk of recombining with wild-type Ad to 
produce potentially-harmful recombinants. 

In part, the present invention discloses a simpler, alternative means of 
reducing the recombination between viral and cellular sequences than those 
5 discussed in the art. One such means is to increase the size of the deletion in the 

recombinant virus and thereby reduce the extent of shared sequences between that 
virus and any Ad genes present in a packaging cell line ~ e.g., the Ad5 genes in 
293 cells, or the various Ad genes in the novel cell lines of the present invention. 
Therefore, the present invention makes it feasible to engineer and produce 
1 0 novel viral vectors that are able to package and deliver significantly larger foreign 

C3 nucleic acid sequences for efficacious use in a variety of therapeutic applications, 

without endangering the subject to whom they are administered, due to their 
impaired ability to self-replicate in non-complementing cell lines. Due to the fact 
p that "helper" viruses or plasmids need not be used in conjunction with many of the 

r ;"|15 viral vectors of the present invention, those vectors of the present invention are 

!L also simpler to use than those previously described in the art. 

L Definitions 



In order to provide a clearer understanding of the specification and claims, 
the following definitions are provided. 

20 Adenoviral Vector or Ad-Derived Vector. Any adenovirus-derived 

plasmid, genome or virus into which a foreign DNA may be inserted or expressed. 
This term may also be used interchangeably with "viral vector." This "type" of 
vector may be utilized to carry nucleotide sequences encoding therapeutic proteins 
or polypeptides to specific cells or cell types in a subject in need of treatment, as 

25 described further herein below. 

Amino Acid Residue'. An amino acid formed upon chemical digestion 
(hydrolysis) of a polypeptide at its peptide linkages. The amino acid residues 
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described herein are preferably in the "L" isomeric form. However, residues in the 
"D" isomeric form can be substituted for any L-amino acid residue, as long as the 
desired functional property is retained by the polypeptide. NH 2 refers to the free 
amino group present at the amino terminus of a polypeptide. COOH refers to the 
5 free carboxy group present at the carboxyl terminus of a polypeptide. In keeping 

with standard polypeptide nomenclature described in J. Biol Chem., 243:3552-59 
(1969) and adopted at 37 C.F.R. §§ 1.821 - 1.822, abbreviations for amino acid 
residues are shown in the following Table of Correspondence: 



Table of Correspondence 



3io 


SYMBOL 


AMINO ACID 




1 -Letter 


3-Letter 






Y 


Tyr 


tyrosine 




G 


Gly 


glycine 


- „,. 


F 


Phe 


phenylalanine 


"lis 


M 


Met 


methionine 




A 


Ala 


alanine 




S 


Ser 


serine 




I 


lie 


isoleucine 




L 


Leu 


leucine 


20 


T 


Thr 


threonine 




V 


Val 


valine 




P 


Pro 


proline 




K 


Lys 


lysine 




H 


His 


histidine 


25 


Q 


Gin 


glutamine 




E 


Glu 


glutamic acid 




Z 


Glx 


Glu and/or Gin 




W 


Trp 


tryptophan 
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R 


Are 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


B 


Asx 


Asn and/or Asp 


C 


Cys 


cysteine 


X 


Xaa 


Unknown or other 



Table 1 



It should be noted that all amino acid residue sequences represented herein by 
formulae have a left to right orientation in the conventional direction of 
-1 0 amino-terminus to carboxyl-terminus. In addition, the phrase "amino acid residue" 

jj is broadly defined to include the amino acids listed in the Table of Correspondence 

: 1 and modified and unusual amino acids, such as those referred to in 37 C.F.R. § § 

yi 1.821-1.822, and incorporated herein by reference. Furthermore, it should be 

3 noted that a dash at the beginning or end of an amino acid residue sequence 

A 5 indicates a peptide bond to a further sequence of one or more amino acid residues 

or to an amino -terminal group such as NH 2 or to a carboxyl-terminal group such 
3 as COOH. 

Complementing Plasmid: This term is generally used herein to describe 
plasmid vectors used to deliver particular nucleotide sequences into a packaging 
20 cell line, with the intent of having said sequences stably integrate into the cellular 

genome. 

Delivery Plasmid. This term is generally used herein to describe a plasmid 
vector that carries or delivers nucleotide sequences in or into a cell line (e.g., a 
packaging cell line) for the purpose of propagating therapeutic viral vectors of the 
25 present invention. 

DNA Homolog: A nucleic acid having a preselected conserved nucleotide 
sequence and a sequence encoding a preferred polypeptide according to the 
present invention, where the nucleic acid is substantial homologous to a named 
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preferred embodiment. By the term "substantially homologous" is meant having 
at least 80%, preferably at least 90%, most preferably at least 95% homology 
therewith. 

The terms "homology" and "identity" are often used interchangeably. 
In this regard, percent homology or identity may be determined, for example, 
by comparing sequence information using a GAP computer program. The GAP 
program utilizes the alignment method of Needleman and Wunsch (/. Mol. 
Biol. 48:443 (1970), as revised by Smith and Waterman (Adv. Appl. Math. 
2:482 (1981). Briefly, the GAP program defines similarity as the number of 
aligned symbols (i.e., nucleotides or amino acids) which are similar, divided 
by the total number of symbols in the shorter of the two sequences. The 
preferred default parameters for the GAP program may include: (1) a unary 
comparison matrix (containing a value of 1 for identities and 0 for 
non- identities) and the weighted comparison matrix of Gribskov and Burgess, 
Nucl. Acids Res. 14:6745 (1986), as described by Schwartz and Dayhoff, eds., 
ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, National Biomedical 
Research Foundation, pp. 353-358 (1979); (2) apenalty of 3.0 for each gap and 
an additional 0. 10 penalty for each symbol in each gap; and (3) no penalty for 
end gaps. 

Whether any two nucleic acid molecules have nucleotide sequences that 
are at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% "identical" can be 
determined using known computer algorithms such as the "FAST A" program, 
using for example, the default parameters as in Pearson and Lipman, Proc. Natl 
Acad. Sci. USA 55:2444 (1988). Alternatively the BLAST function of the 
National Center for Biotechnology Information database may be used to determine 
identity 

In general, sequences are aligned so that the highest order match is 
obtained. "Identity" per se has an art-recognized meaning and can be calculated 
using published techniques. (See, e.g.: Computational Molecular Biology, Lesk, 
A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics 
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and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., 
eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology \ 
vonHeinje, G., Academic Press, 1987; and Sequence Analysis Primer \ Gribskov, 
M. and Devereux, J., eds., M Stockton Press, New York, 1991). While there 
exist a number of methods to measure identity between two polynucleotide or 
polypeptide sequences, the term "identity" is well known to skilled artisans 
(Carillo, H. & Lipton, D., SIAM J Applied Math 45:1073 (1988)). Methods 
commonly employed to determine identity or similarity between two sequences 
include, but are not limited to, those disclosed in Guide to Huge Computers, 
Martin J. Bishop, ed., Academic Press, San Diego, 1994, and Carillo, H. & 
Lipton, D., SIAM J Applied Math 48A013 (1988). Methods to determine identity 
and similarity are codified in computer programs. Preferred computer program 
methods to determine identity and similarity between two sequences include, but 
are not limited to, GCG program package (Devereux, J., et al, Nucleic Acids 
Research 12(I):3S7 (1984)), BLASTP, BLASTN, FASTA (Atschul, S.F., et al y 
JMolec Biol 275:403 (1990)). 

Therefore, as used herein, the term "identity" represents a comparison 
between a test and a reference polypeptide or polynucleotide. For example, a test 
polypeptide may be defined as any polypeptide that is 90% or more identical to a 
reference polypeptide. As used herein, the term at least "90% identical to" refers 
to percent identities from 90 to 99.99 relative to the reference polypeptides. 
Identity at a level of 90% or more is indicative of the fact that, assuming for 
exemplification purposes a test and reference polynucleotide length of 100 amino 
acids are compared. No more than 10% (i.e., 10 out of 100) amino acids in the 
test polypeptide differs from that of the reference polypeptides. Similar 
comparisons may be made between a test and reference polynucleotides. Such 
differences may be represented as point mutations randomly distributed over the 
entire length of an amino acid sequence or they may be clustered in one or more 
locations of varying length up to the maximum allowable, e.g. 10/100 amino acid 
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difference (approximately 90% identity). Differences are defined as nucleic acid 
or amino acid substitutions, or deletions. 

An embodiment of the invention may use polynucleotides at least 90% or 
95% identical to those encoding the TPL nucleic acid sequences. A further 
embodiment of the invention may include those polynucleotides that encode a 
polypeptide of interest that are at least 95% identical when the variation in such 
a polynucleotide is due to more than merely degenerate changes. 

Expression or Delivery Vector: Any plasmid or virus into which a foreign 
DNA may be inserted for expression in a suitable host cell -- i.e., the protein or 
polypeptide encoded by the DNA is synthesized in the host cell's system. Vectors 
capable of directing the expression of DNA segments (genes) encoding one or 
more proteins are referred to herein as "expression vectors." Also included are 
vectors which allow cloning of cDNA (complementary DNA) from mRNAs 
produced using reverse transcriptase. 

Foreign Gene: This term is used to identify a DNA molecule not present 
in the exact orientation and position as the counterpart DNA molecule found in 
wild-type adenovirus. It may also refer to a DNA molecule from another 
organism or species (/.e., exogenous) or from another Ad serotype. 

Gene: A nucleic acid whose nucleotide sequence encodes an RNA or 
polypeptide. A gene can be either RNA or DNA. Genes may include regions 
preceding and following the coding region (leader and trailer) as well as 
intervening sequences (introns) between individual coding segments (exons). 

Isolated: This term is used to indicate a nucleic acid or polypeptide 
sequence separated from the genetic environment from which the sequences were 
obtained. It may also mean altered from the natural state. For example, a 
polynucleotide or a polypeptide naturally present in a living animal is not 
"isolated," but the same polynucleotide or polypeptide separated from the 
coexisting materials of its natural state is "isolated", as the term is employed 
herein. Thus, a polypeptide or polynucleotide produced and/or contained within 
a recombinant host cell is considered isolated for purposes of the present 
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invention. Also intended as an "isolated polypeptide" or an "isolated 
polynucleotide" are polypeptides or polynucleotides that have been purified, 
partially or substantially, from a recombinant host cell or from a native source. 
For example, a recombinantly produced version of a compounds can be 
substantially purified by the one-step method described in Smith and Johnson, 
Gene <57;31-40 (1988). The terms isolated and purified are sometimes used 
interchangeably. 

By "isolated" is meant that the DNA is free of the coding sequences of 
those genes that, in the naturally-occurring genome of the organism (if any) from 
which the DNA of the invention is derived, immediately flank the gene encoding 
the DNA of the invention. The isolated DNA may be single-stranded or 
double-stranded, and may be genomic DNA, cDNA, recombinant hybrid DNA, 
or synthetic DNA. It may be identical to a native DNA sequence, or may differ 
from such sequence by the deletion, addition, or substitution of one or more 
nucleotides. 

Isolated or purified as it refers to preparations made from biological cells 
or hosts should be understood to mean any cell extract containing the indicated 
DNA or protein including a crude extract of the DNA or protein of interest. For 
example, in the case of a protein, a purified preparation can be obtained following 
an individual technique or a series of preparative or biochemical techniques and 
the DNA or protein of interest can be present at various degrees of purity in these 
preparations. The procedures may include for example, but are not limited to, 
ammonium sulfate fractionation, gel filtration, ion exchange change 
chromatography, affinity chromatography, density gradient centrifugation and 
electrophoresis. 

A preparation of DNA or protein that is "pure" or "isolated" should be 
understood to mean a preparation free from naturally occurring materials with 
which such DNA or protein is normally associated in nature. "Essentially pure" 
should be understood to mean a "highly" purified preparation that contains at least 
95% of the DNA or protein of interest. 
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A cell extract that contains the DNA or protein of interest should be 
understood to mean a homogenate preparation or cell-free preparation obtained 
from cells that express the protein or contain the DNA of interest. The term "cell 
extract" is intended to include culture media, especially spent culture media from 
which the cells have been removed. 

Packaging Cell line: A packaging cell line is a cell line that provides a 
missing gene product or its equivalent. 

Particle: The adenovirus (Ad) particle is relatively complex and may be 
resolved into various substructures. The particle is the minimal structural or 
functional unit of a virus. A virus can refer to a single particle, a stock of particles 
or a viral genome. 

Penton: The terms "penton" or "penton complex" are preferentially used 
herein to designate a complex of penton base and fiber. The term "penton" may 
also be used to indicate penton base, as well as penton complex. The meaning of 
the term "penton" alone should be clear from the context within which it is used. 

Plasmid: An autonomous self-replicating extrachromosomal circular DNA 

Post-transcription Regulatory Element (PRE) is a regulatory element 
found in viral or cellular messenger RNA that is not spliced, i.e. intronless 
messages. Examples include, but are not limited to, human hepatitis virus, 
woodchuck hepatitits virus, the TK gene and mouse histone gene. The PRE may 
be placed before a polyA sequence and after a heterologous DNA sequence. 

Pseudotyping: This term as generally used herein describes the 
production of adenoviral vectors having modified capsid protein or capsid proteins 
from a different serotype than the serotype of the vector itself. One example, is 
the production of an adenovirus 5 vector particle containing a chimeric Ad3/Ad5 
fiber protein. This may be accomplished by producing the adenoviral vector in 
packaging cell lines expressing different fiber proteins. 

Promoter: Useful promoters according to the present invention may be 
inducible or constitutive. Inducible promoters will initiate transcription only in the 
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presence of an additional molecule; constitutive promoters, on the other hand, do 
not require the presence of any additional molecule to regulate gene expression. 
A regulatable or inducible promoter may also be described as a promoter wherein 
the rate of RNA polymerase binding and initiation is modulated by external 
5 stimuli. Such stimuli include various compounds or compositions, light, heat, 

stress, chemical energy sources, and the like. Inducible, suppressible and 
repressible promoters are considered regulatable promoters. 

Receptor: Receptor is a term used herein to indicate a biologically active 
molecule that specifically binds to (or with) other molecules. The term "receptor 
10 protein" may be used to more specifically indicate the proteinaceous nature of a 

l: i specific receptor. 

^ Recombinant: As used herein, the term is intended to refer to any progeny 

fij formed as the result of genetic engineering. This may also be used to describe a 

Y : l m virus formed by recombination of plasmids in a packaging cell. 

HJ15 Transgene or Therapeutic Nucleotide Sequence: As described and 

O claimed herein, such a sequence includes DNA and RNA sequences encoding an 

j = RNA or polypeptide. Such sequences may be "native" or naturally-derived 

:: ; sequences; they may also be "non-native" or "foreign" sequences which are 

q naturally- or recombinantly-derived. The term "transgene," which may be used 

20 interchangeably herein with the term "therapeutic nucleotide sequence," is often 

used to describe a heterologous or foreign (exogenous) gene that is carried by a 

viral vector and transduced into a host cell. 

Therefore, therapeutic nucleotide sequences may also include antisense 

sequences or nucleotide sequences which may be transcribed into antisense 
25 sequences. Therapeutic nucleotide sequences (or transgenes) further comprise 

sequences which function to produce a desired effect in the cell or cell nucleus into 

which said therapeutic sequences are delivered. For example, a therapeutic 

nucleotide sequence may encode a functional protein intended for delivery into a 

cell which is unable to produce that functional protein. 
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II* Adenovirus 

Fiber plays a crucial role in adenovirus infection by attaching the virus to 
a specific receptor on the cell surface. The fiber is an elongated protein which 
exists as a trimer of three identical polypeptides (polypeptide IV) of 582 amino 
5 acids in length. An adenovirus fiber consists of three domains: an N-terminal tail 

domain that interacts with penton base; a shaft composed of variable numbers of 
repeats of a 15-amino-acid segment that forms beta-sheet and beta-bends; and a 
knob at the C-terminus ("head domain") that contains the type-specific antigen 
and is responsible for binding to the cell surface receptor. The gene encoding the 

10 fiber protein from Ad2 has been expressed in human cells and has been shown to 

be correctly assembled into trimers, glycosylated and transported to the nucleus. 
(See, e.g., Hong and Engler, Virology 185: 758-761, 1991). Thus, alteration of 
the fiber in recombinant Ad vectors can lead to alteration in gene delivery. This 
has great utility for a variety of gene therapy applications and is one of the objects 

15 of the present invention. 

Hexon, penton and fiber capsomeres are the major components on the 
surface of the virion. Their constituent polypeptides, nos. II, III and IV, contain 
tyrosine residues that are exposed on the surface of the virion and can be labeled — 
e.g., by iodination of intact particles. 

20 The 35,000+ base pair (bp) genome of adenovirus type 2 has been 

sequenced and the predicted amino acid sequences of the major coat proteins 
(hexon, fiber and penton base) have been described. (See, e.g., Neumann et al, 
Gene 69: 153-157 (1988); Herisse etal., Nuc. Acids Res. 9: 4023-4041 (1981); 
Roberts et aL, J. Biol Chem. 259: 13968-13975 (1984); Kinloch et aL, J. Biol. 

25 Chem. 259: 6431-6436 (1984); and Chroboczek et al. y Virol 161: 549-554, 

1987). 

The sequence of Ad5 DNA was completed more recently; its sequence 
includes a total of 35,935 bp. Portions of many other adenovirus genomes have 
also been sequenced. It is presently understood that the upper packaging limit for 
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adenovirus virions is about 105% of the wild-type genome length. (See, e.g., 
Bett, etaL, J. Virol 67(10): 5911-21, 1993). Thus, for Ad2 and Ad5, this would 
be an upper packaging limit of about 38kb of DNA. 

Adenovirus DNA also includes inverted terminal repeat sequences (ITRs) 
ranging in size from about 100 to 150 bp, depending on the serotype. The 
inverted repeats enable single strands of viral DNA to circularize by base-pairing 
of their terminal sequences, and the resulting base-paired "panhandle" structures 
required for replication of the viral DNA. 

For efficient packaging, the ITRs and the packaging signal (a few hundred 
bp in length) comprise the "minimum requirement" for replication and packaging 
of a genomic nucleic acid into an adenovirus particle. Helper-dependent vectors 
lacking all viral ORFs but including these essential cis elements (the ITRs and 
contiguous packaging sequence) have been constructed, but the virions package 
less efficiently that the helper and package as multimers part of the time, which 
suggests that the virus may "want" to package a fuller DNA complement (see, 
e.g., Fisher, etaL, Virology 217: 11-22, 1996). 

The viral vectors of the present invention may retain their ability to express 
the genome packaged within — i.e., they may retain their "infectivity" — they do 
not act as infectious agents, however, to the extent that they cause disease in the 
subjects to whom they are administered for therapeutic purposes. 

It is to be appreciated that Ad vectors have several distinct advantages 
over other viral vectors in the art. For example, recombination of such vectors is 
rare; there are no known associations of human malignancies with adenoviral 
infections despite common human infection with adenoviruses; the genome may 
be manipulated to accommodate foreign genes of a fairly substantial size; and host 
proliferation is not required for expression of adenoviral proteins. 

An extension of this invention is that the Ad-derived viral vectors disclosed 
herein may be used to target and deliver genes into specific cells by incorporating 
the attachment sequence for other receptors (such as CD4) onto the fiber protein 
by recombinant DNA techniques, thus producing a chimeric molecule. This 
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should result in the ability to target and deliver genes into a wide range of cell 
types with the advantage of evading recognition by the host's immune system. The 
within-disclosed delivery systems thus provide for increased flexibility in gene 
design to enable gene delivery into proliferating and nonproliferating cell types. 

For example, U.S. Patent Nos. 5,756,086, and 5,543,328 as well as 
W095/26412 and WO 98/44121 and Krasnykh, et al (J. Virol 70: 6839-46, 
1996), the disclosures of which are incorporated by reference herein, describe 
modifications that may be made to the adenovirus fiber protein. Such 
modifications are useful in altering the targeting mechanism and specificity of 
adenovirus and could readily be utilized in conjunction with the constructs of the 
present invention to target the novel viral vectors disclosed herein to different 
receptors and different cells. Moreover, modifications to fiber protein which alter 
its tropism may permit greater control over the localization of viral vectors in 
therapeutic applications. 

Similarly, incorporation of various structural proteins into cell lines of the 
present invention, whether or not those proteins are modified, is also contemplated 
by the present invention. Thus, for example, modified penton base polypeptides 
such as those described in Wickham, et al (J. Virol 70: 683 1-8, 1996) may have 
therapeutic utility when used according to the within-disclosed methods. 

While some of the Examples appearing below specifically recite fiber 
proteins, polypeptides, and fragments thereof, it is expressly provided herein that 
other structural and non-structural Ad proteins and polypeptides (e.g. , regulatory 
proteins and polypeptides) may be used as components of the various disclosed 
vectors and cell lines. Moreover, chimeric molecules comprised of proteins, 
polypeptides, and/or fragments thereof which are derived from different Ad 
serotypes may be used in any of the within-disclosed methods, constructs and 
compositions. Similarly, recombinant DNA sequences of the present invention 
may be prepared using nucleic acid sequences derived from different Ad serotypes, 
in order to design useful constructs with broad applicability, as disclosed and 
claimed herein. 
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It should also be appreciated that, while the members of Group C or 
Group D adenovirus -i.e., Ad serotypes 1, 2, 5, 6 or 37 -- are specifically recited 
in various examples herein, the present invention is in no way limited to those 
serotypes alone. In view of the fact that the adenovirus serotypes are all closely- 
5 related in structure and functionality, therapeutic viral vectors, packaging cell 

lines, and plasmids of the present invention may be constructed from components 
of any and all Ad serotypes — and the within-disclosed methods of making and 
using the various constructs and cell lines of the present invention apply to all of 
said serotypes. 

1 0 The family of Adenoviridae includes many members with at least 47 known 

% serotypes of human adenovirus (Adl-Ad47) (Shenk, Virology^ Chapter 67, in 

^ Fields etal, eds. Lippincott-Raven, Philadelphia, 1 996,) as well as members of the 

flj genus Mastadenovirus including human, simian, bovine, equine, porcine, ovine, 

canine and opossum viruses, and members of the Aviadenovirus genus, including 
1 y l 5 bird viruses, e.g. CELO. Thus it is contemplated that the disclosed inventions can 

□ be applied to any adenovirus species, and the invention need not be so limited, 

y ; One of skill in the art would have knowledge of the different adenoviruses as 

;i evidenced by (Shenk, Virology^ Chapter 67, in Fields et aL, eds. Lippincott- 

C3 Raven, Philadelphia, 1996,) which is herein incorporated by reference. 

20 III. Packaging Cell Lines 

A. Adenovirus Complentation Genetics 

The first generation of recombinant adenoviral vectors currently available 
typically have a deletion in the first viral early gene region which is generally 
referred to as El , which comprises the El a and Elb regions. (These regions span 
25 genetic map units 1.30 to 9.24.) Figure 3 in chapter 67 of Fields Virology, 3d 

Ed. (Fields et al eds, Lippincott-Raven Publ., Philadelphia, 1996, p. 2116) 
illustrates a transcription and translation map of adenovirus type 2 (Ad2). 
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Deletion of the viral El region renders the recombinant adenovirus 
defective for replication and incapable of producing infectious viral particles in 
subsequently-infected target cells. Thus, to generate El -deleted adenovirus 
genome replication and to produce virus particles requires a system of 
complementation which provides the missing El gene product. El 
complementation is typically provided by a cell line expressing El, such as the 
human embryonic kidney packaging cell line, i.e. an epithelial cell line, called 293 . 
Cell line 293 contains the El region of adenovirus, which provides El gene 
region products to "support" the growth of El -deleted virus in the cell line (see, 
e.g., Graham et al. t J. Gen. Virol 36: 59-71, 1977). Additionally, cell lines that 
may be usable for production of defective adenovirus having a portion of the 
adenovirus E4 region have been reported (WO 96/22378). Multiply deficient 
adenoviral vectors and complementing cell lines have also been described (WO 
95/34671, U.S. Patent No. 5,994,106). Nevertheless, inherent problems exist 
concerning first-generation recombinant adenoviruses. 

B. Adenovirus Particle Packaging Cell Lines 

Packaging cell lines disclosed herein support viral vectors with deletions 
of major portions of the viral genome, without the need for helper viruses. 
Additionally, the invention provides novel cell lines and helper viruses for use with 
helper-dependent vectors. 

Thus, in one embodiment of the present invention, a packaging cell line is 
disclosed having DNA sequences stably integrated into the cellular genome 
wherein the DNA sequences encode one or more adenovirus regulatory and/or 
structural polypeptides which complement the genes deleted or mutated in the 
adenovirus vector genome to be replicated and packaged. 

In another embodiment, the packaging cell line expresses one or more 
adenovirus structural proteins, polypeptides, or fragments thereof, wherein said 
structural protein is selected from the group consisting of penton base, hexon, 
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fiber, polypeptide Ilia, polypeptide V, polypeptide VI, polypeptide VII, 
polypeptide VIII, and biologically active fragments thereof 

In one variation, the sequences are constitutively expressed; in another, 
one or more sequences is under the control of a regulatable promoter. In a 
preferred embodiment expression is constitutive. In various preferred 
embodiments, the polypeptides expressed by the DNA sequences are biologically 
active. 

In a further and preferred embodiment the packaging cell line of the 
present invention supports the production of a viral vector. In a preferred 
embodiment the viral vector is a therapeutic vector. 

The present invention also discloses a packaging cell line which 
complements a viral vector having a deletion or mutation of a DNA sequence 
encoding an adenovirus structural protein, regulatory polypeptides El A and E1B, 
and/or one or more of the following regulatory proteins or polypeptides: E2A, 
E2B, E3, E4, L4, or fragments thereof 

Various useful packaging cells are contemplated which complement 
adenovirus. In one aspect of the present invention, each DNA sequence is 
introduced into the genome of the within-disclosed cell lines via a separate 
complementing plasmid. In other embodiments, two or more DNA sequences 
were introduced into the genome via a single complementing plasmid. In one 
variation, the complementing plasmid comprises a DNA sequence encoding 
adenovirus fiber protein, polypeptide or fragment thereof. An example of a useful 
complementing plasmid according to the present invention is a plasmid having the 
characteristics of pCLF (for deposit details, see Example 3) 

One embodiment discloses a packaging cell useful in the preparation of 
recombinant adenovirus viral vectors comprising a delivery plasmid comprising an 
adenovirus genome lacking a nucleotide sequence encoding fiber. In one 
variation, the delivery plasmid further comprises a nucleotide sequence encoding 
a foreign polypeptide. A preferred delivery plasmid is pDV44, pElB gal, or 
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pElsplB. In another variation, the cell further comprises a complementing 
plasmid containing a nucleotide sequence encoding fiber, the plasmid being stably 
integrated into the cellular genome of the cell. 

In one embodiment, a composition comprises a cell containing first and 
second delivery plasmids wherein a first delivery plasmid comprises an adenovirus 
genome lacking a nucleotide sequence encoding fiber and incapable of directing 
the packaging of new viral particles in the absence of a second delivery plasmid, 
and a second delivery plasmid comprises an adenoviral genome capable of 
directing the packaging of new viral particles in the presence of the first delivery 
plasmid. 

In another variation, the first and second delivery plasmids interact within 
the cell to produce a therapeutic viral vector. In yet another variation, the cell 
further comprises a complementing plasmid containing a nucleotide sequence 
encoding fiber, the plasmid being stably integrated into the cellular genome of the 
cell In still another, the first or second delivery plasmid further comprises a 
nucleotide sequence encoding a foreign polypeptide. In various embodiments, the 
polypeptide is a therapeutic molecule. 

Another embodiment discloses a composition as before, wherein the first 
delivery plasmid lacks adenovirus packaging signal sequences. In another aspect, 
the second delivery plasmid contains a LacZ reporter construct. In another 
variation, the second delivery plasmid further lacks a nucleotide sequence 
encoding an adenovirus regulatory protein. In one variation, the regulatory 
protein is El. In one embodiment of the above-noted compositions, the 
complementing plasmid has the characteristics of pCLF. 

In another embodiment, a composition is disclosed wherein the first 
delivery plasmid lacks a nucleotide sequence encoding an adenovirus structural 
protein and the second delivery plasmid lacks a nucleotide sequence encoding 
adenovirus El protein. In another, the first delivery plasmid lacks a nucleotide 
sequence encoding adenovirus E4 protein and the second delivery plasmid lacks 
a nucleotide sequence encoding adenovirus El protein. In still another, the cell 
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contains at least one complementing plasmid encoding an adenoviral regulatory 
protein and a structural protein. 

In one preferred variation of the present invention, a packaging cell line 
expresses fiber protein. In one embodiment, the fiber protein has been modified 
5 to include a non-native amino acid residue sequence which targets a specific 

receptor, but which does not disrupt trimer formation or transport of fiber into the 
nucleus. In another variation, the non-native amino acid residue sequence alters 
the binding specificity of the fiber for a targeted cell type. In still another 
embodiment, the structural protein is fiber comprising amino acid residue 
10 sequences from more than one adenovirus serotype. As disclosed herein, the 

rijj nucleotide sequences encoding fiber protein or polypeptide need not be modified 

" r solely at one or both termini; fiber protein — and indeed, any of the adenovirus 

: j structural proteins, as taught herein ~ may be modified "internally" as well as at 

Pi the termini. 

1 "1 5 In one variation, the non-native amino acid residue sequence is coupled to 

O the carboxyl terminus of the fiber. In yet another, the non-native amino acid 

M= residue sequence further includes a linker sequence. Alternatively, the fiber 

protein further comprises a ligand coupled to the linker. A suitable ligand may be 
O selected from the group consisting of ligands that specifically bind to a cell surface 

20 receptor and ligands that can be used to couple other proteins or nucleic acid 

molecules. Typically, any of the packaging cell lines of this invention may have 
a DNA sequence encoding all or part of a fiber protein - including modified or 
chimeric proteins ~ stably integrated into the genome. 

In various aspects of the present invention, a packaging cell line of the 
25 present invention is derived from a procaryotic cell line; in another, it is derived 

from a eucaryotic cell line. While various embodiments suggest the use of 
mammalian cells, and more particularly, epithelial cell lines, a variety of other, 
non-epithelial cell lines are used in various embodiments. Thus, while various 
embodiments disclose the use of a cell line selected from the group consisting of 



-34- 

293, A549, W162, HeLa, Vera, 211, and 211 A cell lines, it is understood that 
various other cell lines are likewise contemplated for use as disclosed herein. 

IV. Therapeutic Viral Vectors and Related Systems 

A. Nucleic Acid Segments 

5 A therapeutic viral vector or composition of the present invention 

comprises a nucleotide sequence, nucleic acid molecule or segment as described 
herein. Typically, the nucleic acid molecule or molecule encodes a protein or 
U polypeptide molecule — or a biologically active fragment thereof — which may be 

used for therapeutic applications. A nucleotide sequence may further comprise an 
■ : !jlO enhancer element or a promoter located 3' or 5' to and controlling the expression 

IQ of such a therapeutic nucleotide sequence or gene, 

A subject nucleotide sequence consists of a nucleic acid molecule that 
comprises at least 2 different operatively linked DNA segments. The DNA can 
H; be manipulated and amplified by PCR as described herein and by using standard 

H;l 5 techniques, such as those described mMolecular Cloning: A Laboratory Manual, 

w " 2nd Ed, Sambrook et aL, eds., Cold Spring Harbor, New York (1989). 

Typically, to produce a nucleotide sequence of the present invention, the sequence 
encoding the selected polypeptide and the promoter or enhancer are operatively 
linked to a DNA molecule capable of autonomous replication in a cell either in 
20 vivo or in vitro. By operatively linking the enhancer element or promoter and the 

nucleotide sequence to the vector, the attached segments are replicated along with 
the vector sequences. 

Thus, a recombinant DNA molecule (rDNA) of the present invention is a 
hybrid DNA molecule comprising at least 2 nucleotide sequences not normally 
25 found together in nature. In various preferred embodiments, one of the sequences 

is a sequence encoding an Ad-derived polypeptide, protein, or fragment thereof. 
Stated another way, a nucleotide sequence of the present invention is one that 
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encodes an expressible protein, polypeptide or fragment thereof, and it may further 
include an active constitutive or regulatable (e.g, inducible) promoter sequence. 

A nucleotide sequence of the present invention is optimally from about 20 
base pairs to about 40,000 base pairs in length. Preferably the nucleotide 
sequence is from about 50 bp to about 38,000 bp in length. In various 
embodiments, the nucleotide sequence is of sufficient length to encode one or 
more adenovirus proteins or functional polypeptide portions thereof. Since 
individual Ad polypeptides vary in length from about 19 amino acid residues to 
about 967 amino acid residues, corresponding nucleotide sequences will range 
from about 50 bp up to about 3000 bp, depending on the number and size of 
individual polypeptide-encoding sequences that are "replaced" in the viral vectors 
by therapeutic nucleotide sequences of the present invention. 

1. Tripartite Leader (TPL) Nucleic Acid Sequences 

In one aspect of the invention, it has been discovered that expression of 
adenovirus late proteins such as the structural proteins in a packaging cell line 
according to the present invention is substantially improved when the expression 
cassette present on the complementing plasmid or in the packaging cell line's 
genome contains an adenovirus tripartite leader (TPL) nucleic acid sequence. 

Thus, the invention contemplates a nucleic acid molecule comprising a 
TPL nucleotide sequence. Preferably, the TPL nucleotide sequence may be 
operatively linked to an intron containing RNA processing signals (such as for 
example, splice donor or splice acceptor sites) suitable for expression in the 
packaging cell line. Most preferably the intron contains a splice donor site and a 
splice acceptor site. Alternatively, the TPL nucleotide sequence may not comprise 
an intron. 

In one embodiment, a subject nucleic acid molecule of this invention is 
isolated, i.e., separated from the genetic environment from which the component 
sequences were obtained. Thus, molecular cloning of fragments of a gene will 
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produce an isolated nucleic acid, as will the chemical synthesis of an 
oligonucleotide to build a nucleic acid molecule. 

The intron useful in the present invention is any nucleotide sequence which 
functions in the packaging cell line to provide RNA processing signals, as are well 
5 known in the art, including splicing signals. Introns have been well characterized 

from a large number of structural genes, and therefor the invention should not be 
considered as limited. Well characterized and preferred introns include a native 
intron 1 from adenovirus, such as Ad5's TPL intron 1 ; others include the S V40 VP 
intron; the rabbit beta-globin intron, and synthetic intron constructs. See, for 
10 example, Petitclerc et aL, J. BiothechnoL, 40:169, 1995, and Choi et aL, MoL 

] r; Cell. Biol, 77:3070, 1991. 

: : The TPL nucleotide sequence comprises either (a) first and second TPL 

f y exons or (b) first, second and third TPL exons, where each TPL exon in the 

l?\ sequence is selected from the group consisting of complete TPL exon 1, partial 

1 y 15 TPL exon 1, complete TPL exon 2 and complete TPL exon 3. A complete exon 

C3 is one which contains the complete nucleic acid sequence based on the sequence 

found in the wild type viral genome. Preferably the TPL exons are from Ad2, 
;;i Ad3, Ad5, Ad7 and the like, however, they may come from any Ad serotype, as 

O described herein. A preferred partial TPL exon 1 is described in the Examples. 

20 The use of a TPL with a partial exon 1 has been reported (W098/13499). 

The intron and the TPL exons can be operatively linked in a variety of 
configurations to provide a functional TPL nucleotide sequence, although in some 
embodiments of the invention, an intron may not be a part of the construct. For 
example, the intron can be positioned between any of TPL exons 1, 2 or 3, and the 
25 exons can be in any order of first and second, or first/second/third. The intron can 

also be placed preceding the first TPL exon or following the last TPL exon. In a 
preferred embodiment, complete TPL exon 1 is operatively linked to complete 
TPL exon 2 operatively linked to complete TPL exon 3. In a preferred variation, 
adenovirus TPL intron 1 is positioned between complete TPL exon 1 and 
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complete TPL exon 2. It may also be possible to use analogous translational 
regulators from other viral systems such as rabiesvirus. 

A preferred "complete" TPL nucleic acid molecule containing complete 
TPL exons 1 , 2 and 3 with adenovirus intron 1 inserted between exons 1 and 2 has 
5 a nucleotide sequence shown in SEQ ID NO: 32. A preferred "partial" TPL 

nucleic acid molecule containing partial TPL exon 1 and complete TPL exons 2 
and 3 in that order has a nucleotide sequence shown in SEQ ID NO: 26. The 
construction of these preferred TPL nucleotide sequences is described in the 
Examples. 

10 Thus, preferred expression cassettes and complementing plasmids for 

I j expressing adenovirus structural genes, particularly fiber protein, contain an 
3: ; adenovirus TPL nucleotide sequence as described herein. Preferred packaging cell 
i q lines containing the subject nucleic acid molecules also contain a TPL nucleotide 
:; j sequence of the invention. 

Hjl5 2. Complementing Plasmids 

The invention describes in a related embodiment nucleic acid molecules 

I I and nucleotide sequences, typically in the form of DN A plasmid vectors, which are 
capable of expression of an adenovirus structural protein or regulatory protein. 
Because these expression plasmids are used to complement the defective genes of 

20 a recombinant adenovirus vector genome of this invention, the plasmids are 

referred to as complementing or complementation plasmids. 

The complementing plasmid contains an expression cassette, a nucleotide 
sequence capable of expressing a protein product off the gene encoded by the 
nucleotide sequence. Expression cassettes are described in more detail herein, but 

25 typically contain a promoter and a structural gene operatively linked to the 

promoter and whose expression is controlled by the promoter. A preferred 
complementing plasmid further includes a TPL nucleotide sequence described 
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herein to enhance expression of the structural gene product when used in the 
context of adenovirus genome replication and packaging. 

In one embodiment, a complementing plasmid comprises a promoter 
nucleotide sequence operatively linked to a nucleotide sequence encoding an 
5 adenovirus structural polypeptide. The adenovirus structural polypeptide is 

selected from the group consisting of penton base; hexon; fiber; polypeptide Ilia; 
polypeptide V; polypeptide VI; polypeptide VII; polypeptide VIII; and biologically 
active fragments thereof In another variation, a complementing plasmid further 
comprises a nucleotide sequence encoding a first adenovirus regulatory 
10 polypeptide, a nucleotide sequence encoding a second regulatory polypeptide, a 

nucleotide sequence encoding a third regulatory polypeptide; or any combination 
=.S of the foregoing. 

■ .; The present invention also discloses a complementing plasmid comprising 

j; a promoter nucleotide sequence operatively linked to a nucleotide sequence 

f W 5 encoding an adenovirus structural protein, polypeptide or fragment thereof and a 

H; nucleotide sequence encoding an adenovirus regulatory protein, polypeptide or 

H fragment thereof In one variation, the early region polypeptide is E4; in another, 

the plasmid comprises pE4/Hygro. In still another variation, the early region 
r 1 polypeptides are El and E4. 

20 In another aspect of the present invention, the complementing plasmid 

used to transform a cell line of the present invention further comprises a DNA 
sequence encoding an adenovirus regulatory protein, polypeptide or fragment 
thereof In one variation, the regulatory protein is selected from the group 
consisting of El A, E1B, E2A, E2B, E3, E4 and L4 (also referred to as "the 100K 
25 protein"); an exemplary complementing plasmid has the characteristics of is 

pE4/Hygro (for deposit details, see the Examples). In another aspect, the 
complementing plasmid used to transform a cell line of the present invention 
further comprises a DNA sequence encoding two or more of the above mentioned 
adenovirus regulatory proteins, polypeptides or fragments thereof 
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Preferred complementing plasmids include pCLF, pDV60,. pDV61, 
pD V67, pD V69, pDV80, pD V90 and the like plasmids described in the Examples. 
The nucleic acid sequence of particularly preferred complementing plasmids are 
shown in SEQ ID NO: 43 for pDV60, SEQ ID NO: 44 for pDV67 and SEQ ID 
5 NO: 47 for pDV69, SEQ ID NO: 64 for pDV80 and SEQ ID NO: 65 for pDV90. 

3. Nucleic Acid Molecule Synthesis 

A nucleic acid molecule comprising synthetic oligonucleotide sequences 
of the present invention can be prepared using any suitable method, such as, the 
i.2 phosphotriester or phosphodiester methods. See Narang etal, Meth. EnzymoL, 

J;:iO 68:90, (1979); U.S. Patent No. 4,356,270; and Brown et cd. 9 Metk EnzymoL, 

\ * 68: 109, (1979), the disclosures of which are incorporated by reference herein. 

C3 For oligonucleotide sequences in which a family of variants is preferred, 

1" the synthesis of the family members can be conducted simultaneously in a single 

\ A reaction vessel, or can be synthesized independently and later admixed in 

Ml 5 preselected molar ratios. For simultaneous synthesis, the nucleotide residues that 

are conserved at preselected positions of the sequence of the family member can 
be introduced in a chemical synthesis protocol simultaneously to the variants by 
the addition of a single preselected nucleotide precursor to the solid phase 
oligonucleotide reaction admixture when that position number of the 
20 oligonucleotide is being chemically added to the growing oligonucleotide polymer. 

The addition of nucleotide residues to those positions in the sequence that vary 
can be introduced simultaneously by the addition of amounts, preferably equimolar 
amounts, of multiple preselected nucleotide precursors to the solid phase 
oligonucleotide reaction admixture during chemical synthesis. For example, where 
25 all four possible natural nucleotides (A,T,G and C) are to be added at a 

preselected position, their precursors are added to the oligonucleotide synthesis 
reaction at that step to simultaneously form four variants. 
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This manner of simultaneous synthesis of a family of related 
oligonucleotides has been previously described for the preparation of "Degenerate 
Oligonucleotides" by Ausubel et al (Current Protocols in Molecular Biology, 
SuppL 8. p.2. 1 1.7, John Wiley & S ons, Inc., New York ,1991), and can readily 
be applied to the preparation of the therapeutic oligonucleotide compositions 
described herein. 

Nucleotide bases other than the common four nucleotides (A,T,G or C), 
or the RNA equivalent nucleotide uracil (U), can also be used in the present 
invention. For example, it is well known that inosine (I) is capable of hybridizing 
with A, T and G, but not C. Examples of other useful nucleotide analogs are 
known in the art and may be found referred to in 37 C.F.R. §1.822. 

Thus, where all four common nucleotides are to occupy a single position 
of a family of oligonucleotides, that is, where the preselected nucleotide sequence 
is designed to contain oligonucleotides that can hybridize to four sequences that 
vary at one position, several different oligonucleotide structures are contemplated. 
The composition can contain four members, where a preselected position contains 
A,T,G or C. Alternatively, a composition can contain two nucleotide sequence 
members, where a preselected position contains I or C, and has the capacity the 
hybridize at that position to all four possible common nucleotides. Finally, other 
nucleotides may be included at the preselected position that have the capacity to 
hybridize in a non- destabilizing manner with more than one of the common 
nucleotides in a manner similar to inosine. 

Similarly, larger nucleic acid molecules can be constructed in synthetic 
oligonucleotide pieces, and assembled by complementary hybridization and 
ligation, as is well known. 

B. Adenovirus Expression Vector Systems 

One key component of the present invention for producing gene therapy 
reagents comprised of recombinant adenovirus particles is the recombinant 
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adenovirus vector genome which is encapsulated in the virus particle and which 
expresses exogenous genes in a gene therapy setting. Thus, the components of an 
recombinant adenovirus vector genome include the ability to express selected 
adenovirus structural genes, to express a desired exogenous protein, and to 
contain sufficient replication and packaging signals that the genome is packaged 
into a gene delivery vector particle. The preferred replication signal is an 
adenovirus inverted terminal repeat containing an adenovirus origin of replication, 
as is well known and described herein. 

According to the present invention, a preferred recombinant adenovirus 
vector genome is "helper independent" which means the genome can replicate and 
be packaged without the help of a second, complementing helper virus. Instead, 
the complementation is provided by a packaging cell line of the present invention. 
Additional embodiments of the invention, however, are drawn to a vector genome 
referred to as "gutless 11 which is "helper dependent." 

In apreferred embodiment, the adenovirus vector genome does not encode 
a functional adenovirus fiber protein. A non-functional fiber gene refers to a 
deletion, mutation or other modification to the adenovirus fiber gene such that the 
gene does not express any or insufficient adenovirus fiber protein to package a 
fiber- containing adenovirus particle without complementation of the fiber gene by 
a complementing plasmid or packaging cell line. Such a genome is referred to as 
a "fiberless" genome, not to be confused with a fiberless particle. Alternatively, 
a fiber protein may be encoded but is insufficiently expressed to result in a fiber 
containing particle. 

Thus, the invention describes a helper-independent fiberless recombinant 
adenovirus vector genome comprising genes which (a) express all adenovirus 
structural gene products but express insufficient adenovirus fiber protein to 
package a fiber-containing adenovirus particle without complementation of said 
fiber gene, (b) express an exogenous protein, and (c) contains an adenovirus 
packaging signal and inverted terminal repeats containing adenovirus origin of 
replication. 
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The introduction of exogenous DNA into eucaryotic cells has become one 
of the most powerful tools of the molecular biologist. The term "exogenous" 
encompasses any therapeutic composition of this invention which is administered 
by the therapeutic methods of this invention. Thus, "exogenous" may also be 
5 referred to herein as "foreign," "non-native," and the like. The methods of this 

invention preferably require efficient delivery of the DNA into the nucleus of the 
recipient cell and subsequent identification of cells that are expressing the foreign 
DNA. 

The adenovirus vector genome is propagated in the laboratory in the form 
10 of rDNA plasmids containing the genome, and upon introduction into an 

* :! i appropriate host, the viral genetic elements provide for viral genome replication 

and packaging rather than plasmid-based propogation. Exemplary methods for 
r j preparing an Ad-vector genome are described in the Examples. 

fl A widely-used plasmid is pBR322, a vector whose nucleotide sequence 

riJ15 and endonuclease cleavage sites are well known. Various other useful plasmid 

O vectors are described in the Examples that follow. 

[ , A nucleic acid vector of the present invention comprises a nucleic acid 

3 (preferably DNA) molecule capable of autonomous replication in a cell and to 

p which a DNA segment, e.g. y a gene or polynucleotide, can be operatively linked 

20 so as to bring about replication of the attached segment. In the present invention, 

one of the nucleotide segments to be operatively linked to vector sequences 
encodes at least a portion of a therapeutic nucleic acid molecule — in effect, a 
nucleic acid sequence that encodes one or more therapeutic proteins or 
polypeptides, or fragments thereof. 
25 As one of skill in the art will note, in various embodiments of the present 

invention, different "types" of vectors are disclosed. For example, one "type" of 
vector is used to deliver particular nucleotide sequences into a packaging cell line, 
with the intent of having said sequences stably integrate into the cellular genome; 
these "types" of vectors are generally identified herein as complementing plasmids. 
30 A further "type" of vector described herein carries or delivers nucleotide 



-43- 

sequeaces in or into a cell line (e.g., a packaging cell line) for the purpose of 
propagating therapeutic viral vectors of the present invention; hence, these vectors 
are generally referred to herein as delivery plasmids. A third "type" of vector 
described herein is utilized to carry nucleotide sequences encoding therapeutic 
proteins or polypeptides to specific cells or cell types in a subject in need of 
treatment; these vectors are generally identified herein as therapeutic viral vectors 
or Ad-derived vectors and are in the form of a virus particle encapsulating a viral 
nucleic acid containing an expression cassette nucleic acid sequence for expressing 
the therapeutic gene. 

1. Nucleic Acid Gene Expression Cassettes 

In various embodiments, a peptide-coding sequence of the therapeutic 
gene is inserted into an expression vector and expressed; however, it is also 
feasible to construct an expression vector which also includes some non-coding 
sequences as well. Preferably, however, non-coding sequences are excluded. 
Alternatively, a nucleotide sequence for a soluble form of a polypeptide may be 
utilized. Another preferred therapeutic viral vector includes a nucleotide sequence 
encoding at least a portion of a therapeutic nucleotide sequence operatively linked 
to the expression vector for expression of the coding sequence in the therapeutic 
nucleotide sequence. 

As used herein with regard to DNA sequences or segments, the phrase 
"operatively linked" generally means the sequences or segments have been 
covalently joined into one piece of DNA, whether in single or double stranded 
form. 

The choice of viral vector into which a therapeutic nucleotide sequence of 
this invention is operatively linked depends directly, as is well known in the art, 
on the functional properties desired, e.g., vector replication and protein 
expression, and the host cell to be transformed - these being limitations inherent 
in the art of constructing recombinant DNA molecules. Although certain 
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adenovirus serotypes are recited herein in the form of specific examples, it should 
be understood that the present invention contemplates the use of any adenovirus 
serotype, including hybrids and derivatives thereof As one will observe, it is not 
unusual or outside the scope of the present invention to utilize nucleotide and/or 
amino acid residue sequences of two or more serotypes in constructs, 
compositions and methods of the invention. 

A translatable nucleotide sequence is a linear series of nucleotides that 
provide an uninterrupted series of at least 8 codons that encode a polypeptide in 
one reading frame. Preferably, the nucleotide sequence is a DNA sequence. The 
vector itself may be of any suitable type, such as a viral vector (RNA or DNA), 
naked straight-chain or circular DNA, or a vesicle or envelope containing the 
nucleic acid material and any polypeptides that are to be inserted into the cell. 

2. Promoters 

As noted elsewhere herein, an expression nucleic acid in an Ad-derived 
vector of the present invention may also include a promoter sequence. 

In general, promoters are DNA segments that contain a DNA sequence 
that controls the expression of a gene located 3* or downstream of the promoter. 
The promoter is the DNA sequence to which RNA polymerase specifically binds 
and initiates RNA synthesis (transcription) of that gene, typically located 3 f of the 
promoter. A promoter also includes DNA sequences which direct the initiation of 
transcription, including those to which RNA polymerase specifically binds. If more 
than one nucleic acid sequence encoding a particular polypeptide or protein is 
included in a therapeutic viral vector or nucleotide sequence, more than one 
promoter or enhancer element may be included, particularly if that would enhance 
efficiency of expression. For purposes of the present invention, regulatable 
(inducible) as well as constitutive promoters may be used, either on separate 
vectors or on the same vector. 
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Both constitutive and regulatable (often called "inducible") promoters are 
useful in constructs and methods of the present invention. For example, some 
useful regulatable promoters are those of the CREB -regulated gene family and 
include inhibin, gonadotropin, cytochrome c, glucagon, and the like. (See, e.g., 
published International App. No. WO96/14061, the disclosure of which is 
incorporated by reference herein.) 

A regulatable or inducible promoter may be described as a promoter 
wherein the rate of RNA polymerase binding and initiation is modulated by 
external stimuli. (See U.S. Patent Nos. 5,750,396 and 5,998,205). Such stimuli 
include various compounds or compositions, light, heat, stress, chemical energy 
sources, and the like. Inducible, suppressible and repressible promoters are 
considered regulatable promoters. 

Regulatable promoters may also include tissue-specific promoters. Tissue- 
specific promoters direct the expression of the gene to which they are operably 
linked to a specific cell type. Tissue-specific promoters cause the gene located 3 1 
of it to be expressed predominantly, if not exclusively, in the specific cells where 
the promoter expressed its endogenous gene. Typically, it appears that if a 
tissue- specific promoter expresses the gene located 3' of it at all, then it is 
expressed appropriately in the correct cell types (see, e.g., Palmiter et aL, Ann. 
Rev. Genet. 20: 465-499, 1986). 

When a tissue-specific promoter controls the expression of a gene, that 
gene will be expressed in a small number of tissues or cell types rather than in 
substantially all tissues and cell types. Examples of tissue-specific promoters 
include the immunoglobulin promoter described by Brinster et aL, Nature 306: 
332-336 (1983) and Storb et aL, Nature 310: 238-231 (1984); the elastase-I 
promoter described by Swift et aL, Cell 38: 639-646 (1984); the globin promoter 
described by Townes et aL, MoL Cell. Biol. 5: 1977-1983 (1985), and Magram 
et aL, MoL Cell. Biol. 9: 4581-4584 (1989), the insulin promoter described by 
Bucchini etaL, PNAS USA, 83: 2511-2515 (1986) and Edwards etal, Cell 58: 
161 (1989); the immunoglobulin promoter described by Ruscon et aL, Nature 
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314: 330-334 (1985) and Grosscheld etaL, Cell 38; 647-658 (1984); the alpha 
actin promoter described by Shani, Mol Cell Biol 6: 2624-2631 (1986); the 
alpha crystalline promoter described by Overbeek et al, PNAS USA 82: 
7815-7819 (1985); the prolactin promoter described by Crenshaw et al, Genes 
5 and Development 3: 959-972 (1989); the propiomelanocortin promoter described 

by Tremblay etaL, PNAS USA 85: 8890-8894 (1988); the beta-thyroid stimulating 
hormone (BTSH) promoter described by Tatsumi et al, Nippon Rinsho 47: 
2213-2220 (1989); the mouse mammary tumor virus (MMTV) promoter 
p described by Muller et al y Cell 54: 105 (1988); the albumin promoter described 

" :il0 by Palmiter et al, Ann, Rev. Genet 20: 465-499 (1986); the keratin promoter 

£0 described by Vassar et al, PNAS USA 86: 8565-8569 (1989); the osteonectin 

j i promoter described by McVey et al., J. Biol Chem. 263: 11,111-11,116(1988); 

I:] the prostate-specific promoter described by Allison et al, Mol Cell Biol 9: 

J; 2254-2257 (1989); the opsin promoter described by Nathans et al, PNAS USA 

1^15 81: 4851-4855 (1984); the olfactory marker protein promoter described by 

Danciger et al, PNAS USA 86: 8565-8569 (1989); the neuron- specific enolase 
(NSE) promoter described by Forss-Pelter et al, J. Neurosci. Res. 16: 141-151 

(1986) ; the L-7 promoter described by Sutcliffe, Trends in Genetics 3: 73-76 

(1987) and the protamine 1 promoter described Peschon et al, Ann. New York 
20 Acad. Set 564: 186-197 (1989) and Braun et al, Genes and Development 3: 

793-802 (1989). (The disclosures of all references cited are incorporated by 
reference herein.) 

3. Adenovirus Vectors 

Although adenovirus consists of many proteins, not all adenovirus proteins 
25 are required for assembly of a recombinant adenovirus particle (vector) of this 

invention. Thus, deletion of the appropriate genes from a recombinant Ad vector 
as taught herein will thus allow the vector to accommodate even larger "foreign" 
DNA segments. Thus, if the sequences encoding one or more adenovirus 



.47- 

polypeptides or proteins are supplanted by a recombinant nucleotide sequence of 
the present invention, the length of the recombinant sequence can conceivably 
extend nearly to the packaging limit of the relevant adenovirus-derived vector. 

In view of the fact that preferred embodiments disclosed herein are helper- 
independent Ad-derived vectors, the entire wild-type Ad genome cannot be 
completely supplanted by recombinant nucleic acid molecules without 
transforming such a vector into a vector requiring "help" of some kind. However, 
most of the Ad-derived vectors of the present invention do not depend on a helper 
virus; instead, the vectors of the present invention are propagated in cell lines 
stably expressing proteins or polypeptides that have been removed from said 
vectors to allow the addition of "foreign" DNA into the vectors. In various 
disclosed embodiments, specific early region and structural polypeptides are 
deleted from the vectors of the present invention, thereby enabling the vectors to 
accommodate recombinant nucleic acid sequences (or cassettes) of various 
lengths. For example, Ad-derived vectors of the present invention may easily 
include 12 kb or more of foreign (or "therapeutic") DNA sequences. 

Thus, adenovirus viral vectors are also disclosed which comprise 
nucleotide sequences encoding a packaging signal and a foreign protein or 
polypeptide, wherein the nucleotide sequence encoding an adenovirus structural 
protein has been deleted. 

In one variation, the nucleotide sequence encoding the foreign protein or 
polypeptide is a DNA molecule up to about 3 kb in length; in another, the 
nucleotide sequence encoding the foreign protein or polypeptide is a DNA 
molecule up to about 9.5 kb in length; in still another, the nucleotide sequence 
encoding the foreign protein or polypeptide is a DNA molecule up to about 12.5 
kb in length. Nucleotide sequences of intermediate lengths are also contemplated 
by the present invention, as are sequences in excess of 12.5 kb. 

The invention also discloses viral vectors wherein the sequence encoding 
a foreign protein or polypeptide is a sequence encoding an anti-tumor agent, a 
tumor suppressor protein, a suicide protein, or a fragment or functional equivalent 
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thereof. In one variation, nucleotide sequences encoding one or more regulatory 
proteins have also been deleted from the vector. In another, the regulatory 
proteins are selected from the group consisting of El A, E1B, E2A, E2B, E3, E4, 
and L4 (100K protein). 

A wide variety of therapeutic viral vectors are also embodiments of the 
present invention. In one embodiment, a therapeutic viral vector is disclosed 
which lacks a DNA sequence encoding fiber protein, or a portion thereof In 
another variation, a therapeutic viral vector may further or alternatively comprise 
deletion of a DNA sequence encoding one or more regulatory proteins, 
polypeptides, or fragments thereof. In various embodiments, foreign DNA 
sequences are inserted in place of the DNA sequence encoding fiber protein in the 
viral vectors of the present invention. In other embodiments, the therapeutic viral 
vectors further comprise foreign DNA sequences inserted in place of the DNA 
sequences encoding one or more regulatory proteins, polypeptides, or fragments 
thereof, and/or one or more structural proteins, polypeptides, or fragments 
thereof. 

The present invention further discloses a number of viral vectors. In one 
variation, a viral vector comprises a deletion or mutation of a DNA sequence 
encoding an adenovirus structural protein, polypeptide, or fragment thereof A 
vector may further comprise deletion or mutation of the DNA sequences encoding 
regulatory polypeptides El A and E1B; and it may still further comprise deletion 
or mutation of the DNA sequence encoding one or more of the following 
regulatory proteins or polypeptides: E2A, E2B, E3, E4, L4, or fragments thereof. 
In another variation, in a viral vector of the present invention, the structural 
protein comprises fiber. Any combination of the foregoing is also contemplated 
by the present invention. The viral vectors of the present invention are suitable for 
the preparation of pharmaceutical compositions comprising any of the therapeutic 
viral vectors disclosed herein — including combinations thereof ~ are also 
disclosed herein. A further use of the viral vectors of the present invention is for 
targeting specific cells in a cell population comprising different cell types. 
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Yet another variation discloses that a foreign DNA sequence encoding one 
or more foreign proteins, polypeptides or fragments thereof has been inserted in 
place of any of the deletions in the therapeutic viral vector. In one embodiment, 
the foreign DNA encodes a tumor-suppressor protein or a biologically active 
fragment thereof. In another embodiment, the foreign DNA encodes a suicide 
protein or a biologically active fragment thereof. 

The invention further contemplates that a viral vector comprises a foreign 
DNA sequence encoding one or more foreign proteins, polypeptides or fragments 
thereof wherein said DNA sequence has been inserted in place of any structural 
and/or regulatory proteins (or portions thereof) that have been deleted. Thus, in 
one embodiment, the foreign DNA encodes a therapeutic molecule such as a 
tumor- suppressor protein; a suicide protein; a cystic fibrosis transmembrane 
conductance regulator (CFTR) protein; or a biologically active fragment of any 
of them. 

The therapeutic (or foreign) nucleotide sequence can be a gene or gene 
fragment that encodes a protein or polypeptide — or a biologically active fragment 
thereof (See, e.g., Crystal, et al, Nature Genetics 8: 42-51 (1994); Zabner, et 
al, Cell 75: 207-216 (1993); Knowles, et al, NEJM 333(13) : 823-831 (1995); 
and Rosenfeld, et al, Cell 68: 143-155 (1992), the disclosures of which are 
incorporated by reference herein.) 

It is further contemplated that a therapeutic protein or polypeptide 
expressed by a therapeutic viral vector of the present invention may be used in 
conjunction with another therapeutic agent when appropriate — e.g. 7 a thymidine 
kinase metabolite may be used in conjunction with the gene encoding thymidine 
kinase and its gene product — in order to be even more effective. 

Alternatively, a therapeutic viral vector can include a DNA or RNA 
oligonucleotide sequence that exhibits therapeutic activity without needing to be 
translated into a polypeptide product before exerting a therapeutic effect. 
Examples of the latter include antisense oligonucleotides that will inhibit the 
transcription of deleterious genes or ribozymes that act as site-specific 
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ribonucleases for cleaving selected mutated gene sequences. In another variation, 
a therapeutic nucleotide sequence of the present invention may comprise a DNA 
construct capable of generating therapeutic nucleotide molecules, including 
ribozymes and antisense DNA, in high copy numbers in target cells, as described 
in published PCT application No. WO 92/06693 (the disclosure of which is 
incorporated herein by reference). Other preferred therapeutic nucleotide 
sequences according to the present invention are capable of delivering HIV 
antisense oligonucleotides to latently-infected T cells via CD4. Similarly, delivery 
of Epstein-Barr Virus (EBV) EBNa-1 antisense oligonucleotides to B cells via 
CR2 is capable of effecting therapeutic results. 

A preferred recombinant adenovirus vector genome is based on the vector 
described in the Examples and designated Ad5.Bgal.AF. This vector is a helper 
independent, fiberless vector genome which can host, upon insertion, an 
exogenous gene for expression of an exogenous or therapeutic protein. The 
genome of Ad5.Bgal.AF has a nucleotide sequence shown in SEQ ID NO: 27. 
A virus particle containing Ad5.Bgal.AF vector genome has been prepared as 
described in the Examples and is deposited with the ATCC as Accession No. VR- 
2636 

The Ad5 .Bgal AF genome nucleic acid can be manipulated to contain any 
exogenous gene in place of the beta-galactosidase gene present in the construct, 
as described herein. 

V. Construction of Therapeutic Viral Vectors for Gene Delivery 
A. Adenovirus Particles 

Various methods of making and using the vectors, plasmids, cell lines and 
other compositions and constructs of the present invention are also disclosed 
herein. The following methods are considered exemplary and not limiting. 
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Thus, in one variation, the invention discloses a method of constructing 
therapeutic viral vectors, comprising introducing a delivery plasmid into an Ad 
fiber-expressing complementing cell line, wherein the DNA sequence encoding Ad 
fiber protein has been deleted from the delivery plasmid. In one variation, the 
5 delivery plasmid further includes a DNA sequence encoding a foreign protein, 

polypeptide, or fragment thereof In other embodiments, a combination of pDV44 
and pDElBPgal or a similar construct such as, for example, that found in pDV44, 
pAElBPgal or the equivalent. 

A recombinant adenovirus particle may be produced with a fiber protein, 
10 or it may be produced without a fiber protein ("fiberless particle") according to 

^ the present invention. Where the particle is made without fiber, such as by 

passaging the fiberless viral vector genome, e.g., Ad5.Bgal.AF in the 293 cells, a 
f y fiberless genome is replicated and packaged in a fiberless particle. In contrast, 

Iq where the fiberless genome Ad5.Bgal.AF is passaged in the 21 IB or other fiber 

* y 15 expressing cells, a fiberless genome is replicated and packaged into a fiber- 

O containing particle. 

iu Recombinant adenovirus particles may be made such that they include no 

::!: fiber proteins, modified fiber proteins or other exogenous proteins. They may also 

□ be produced in systems using either helper-independent or helper-dependent 

20 adenovirus recombinant genomes, i.e. with or without helper viruses. 

B* Targeting of Particles to Tissues - Virus Trophism 

A preferred viral vector particle in which therapeutic nucleotide 
compositions of this invention are present is derived from adenovirus (Ad). As 
taught herein, viral vector particles of this invention may be designed and 
25 constructed in such a way that they specifically target a preselected recipient cell 

type, depending on the nature of therapy one seeks to administer. Methods of 
making and using therapeutic viral vectors that target specific cells are further 
described in the Examples that follow. 
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Novel vectors, viral particles or compositions may also be designed and 
prepared to preferentially target cells that might not otherwise be targeted by wild- 
type adenovirus virions. For example, in order to target non-epithelial cells, one 
following the teachings of the present specification may be able to prepare a 
therapeutic vector particle including a nucleotide sequence encoding a foreign 
protein, polypeptide or other ligand directed to a non-epithelial cell or to a 
different receptor than that generally targeted by a particular adenovirus. 
Examples of useful ligands directed to specific receptors (identified in parentheses) 
include the V3 loop of HIV gpl20 (CD4); transferrin (transferrin receptor), LDL, 
apolipoprotein B100, apolipoprotein E (LDL receptors); and deglycosylated 
proteins (asialoglycoprotein receptor). Various useful ligands which may be 
added to adenovirus fiber — and methods for preparing and attaching same ~ are 
set forth in U.S. Patent Nos. 5,756,086 and 5,543,328, the disclosures of which 
are incorporated by reference herein. 

In yet another embodiment, the non-native amino acid residue sequence 
is incorporated into the fiber amino acid residue sequence at a location other than 
one of the fiber termini. Alternatively, the non-native amino acid residue sequence 
alters the binding specificity of the fiber for a targeted cell type. In other 
embodiments, the linker sequence alters the binding specificity of the fiber for a 
targeted cell type. The expressed fiber may, in various embodiments, bind to a 
specific targeted cell type not usually targeted by adenovirus and/or may comprise 
amino acid residue sequences from more than one adenovirus serotype. 

Useful ligands may be encoded by a foreign nucleotide sequence contained 
within a viral vector of the present invention, or may be linked to proteins or 
polypeptides, include antibodies and attachment sequences, as well as receptors 
themselves. For example, antibodies to cell receptor molecules such as integrins 
and the like, MHC Class I and Class II, asialoglycoprotein receptor, transferrin 
receptors, LDL receptors, CD4, and CR2 are but a few which are useful according 
to the present invention. It is also understood that the ligands typically bound by 
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receptors, as well as analogs to those ligands, may be used as cellular targeting 
agents, as disclosed herein. 

VL Therapeutic Methods 

The recombinant adenovirus vectors of the present invention, typically in 
5 the form of an adenovirus particle encapsulating a recombinant adenovirus vector 

genome containing an expression cassette for expressing a therapeutic gene, are 
particularly suited for gene therapy. Thus, various therapeutic methods are 
^ contemplated by the present invention. 

For example, it has now been discovered that Ad-derived viral vectors are 
ml 0 capable of delivering a therapeutic nucleotide sequence to a specific cell or tissue, 

''ifi based on the tissue tropism of the particle, thereby expanding and enhancing 

J;;- treatment options available in numerous conditions in which more conventional 

£ therapies are of limited efficacy. Accordingly, methods of gene therapy utilizing 

il a recombinant adenovirus particle containing a modified fiber or chimeric fiber 

r !15 which targets a preselected tissue, as described herein, is within the scope of the 

C 3 invention. Vector particles are typically purified and then an effective amount is 

administered in vivo or ex vivo (in vitrojmto the subject. 

For in vitro or ex vivo gene transfer, administration is often accomplished 
by first isolating a selected cell population from a patient such as lung epithelial 
20 cells, lymphocytes and the like followed by in vitro or ex vivo gene transfer of the 

therapeutic compositions of this invention and the replacement of the cells into the 
patient. In vivo therapy is also contemplated, e.g., via the administration of 
therapeutic compositions of this invention by various delivery means. For 
example, aerosol administration and administration via subcutaneous, intravenous, 
25 intraperitoneal, intramuscular, ocular means and the like are also within the scope 

of the present invention. 

Other gene-delivery methods are also useful in conjunction with the 
methods, compositions and constructs of the present invention; see, e.g., published 
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International Application No. WO 95/11984, the disclosures of which are 
incorporated by reference herein. 

The present invention also contemplates various methods of targeting 
specific cells — e.g., cells in a subject in need of diagnosis and/or treatment. As 
discussed herein, the present invention contemplates that the viral vectors and 
compositions of the present invention may be directed to specific receptors or 
cells, for the ultimate purpose of delivering those vectors and compositions to 
specific cells or cell types. The viral vectors and constructs of the present 
invention are particularly useful in this regard. 

In general, adenovirus attachment and uptake into cells are separate but 
cooperative events that result from the interaction of distinct viral coat proteins 
with a receptor for attachment and a v integrin receptors for internalization. 
Adenovirus attachment to the cell surface via the fiber coat proteins has been 
discovered to be dissociable and distinct from the subsequent step of 
internalization, and the present invention is able to take advantage of and function 
in conjunction with these differing receptors. 

The invention also discloses methods of transforming a pathologic 
hyperproliferative mammalian cell comprising contacting the cell with any of the 
vectors described herein. In another embodiment, methods of infecting a 
mammalian target cell with a viral vector containing a preselected foreign 
nucleotide sequence are disclosed. One such variation comprises the following 
steps: (a) infecting the target cell with a viral vector of the present invention, the 
viral vector carrying a preselected foreign nucleotide sequence; and (b) expressing 
the foreign nucleotide sequence in the targeted cell. 

The invention also encompasses mammalian target cells infected with a 
preselected foreign nucleotide sequence produced by the methods disclosed 
herein. In one variation, the target cells are selected from the group consisting of 
replicating, slow-replicating and non-replicating human cells. 

Methods of treating an acquired or hereditary disease are also disclosed. 
One method comprises (a) administering a pharmaceutically acceptable dose of a 



-55- 

viral vector to a target cell, wherein the vector comprises a preselected therapeutic 
nucleotide sequence; and (b) expressing the therapeutic sequence in the target cell 
for a time period sufficient to ameliorate the acquired or hereditary disease in the 
cell. Method of gene therapy comprising administering to a subject an effective 
amount of a therapeutic viral vector produced by a packaging cell line of the 
present invention are also disclosed. 

Also contemplated by the present invention are various methods of 
inhibiting the proliferation of a tumor in a subject comprising administering an 
effective amount of a therapeutic viral vector of the present invention under 
suitable conditions to the subject. In one variation, the gene encodes an anti- 
tumor agent. In another variation, the agent is a tumor- suppressor gene. In still 
another embodiment, the agent is a suicide gene or a functional equivalent thereof. 
In another variation, the vector is administered via intra-tumoral injection. 

A composition of this invention may be used prophylactically or 
therapeutically in vivo to disrupt HIV infection and mechanisms of action by 
inhibiting gene expression or activation, via delivery of antisense HIV sequences 
or ribozymes to T cells or monocytes. Using methods of the present invention, 
one may target therapeutic viral vectors as disclosed herein to specific cells and 
tissues, including hematopoietic cells, as infection of such cells appears to be 
mediated by distinct integrins to which viral vectors of the present invention may 
readily be targeted. (See, e.g., Huang, et al. f J. Virol 70: 4502-8, 1996). 

Other useful therapeutic nucleotide sequences include antisense nucleotide 
sequences complementary to EBV EBNa-1 gene. Use of such therapeutic 
sequences may remediate or prevent latent infection of B cells with EBV. As 
discussed herein and in the Examples below, targeting and delivery may be 
accomplished via the use of various ligands, receptors, and other appropriate 
targeting agents. 

Thus, in one embodiment, a therapeutic method of the present invention 
comprises contacting the cells of a subject infected with EBV or HIV with a 
therapeutically effective amount of a pharmaceutically acceptable composition 
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comprising a therapeutic nucleotide sequence of this invention. In a related 
embodiment, the contacting involves introducing the therapeutic nucleotide 
sequence composition into cells having an EBV or HIV-mediated infection. 

Methods of gene therapy are well known in the art (see, e.g., Larrick and 
Burck, Gene Therapy: Application of Molecular Biology, Elsevier Science Publ 
Co., Inc., New York, NY (1991); Kriegler, Gene Transfer and Expression: A 
Laboratory Manual, W. H. Freeman and Company, New York, 1990). The term 
"subject" should be understood to include any animal — particularly mammalian — 
patient, such as any murine, rat, bovine, porcine, canine, feline, equine, ursine, or 
human patient. 

When the foreign gene carried in the vector encodes a tumor suppressor 
gene or another anti-tumor protein, the vector is useful to treat or reduce 
hyperproliferative cells in a subject, to inhibit tumor proliferation in a subject or 
to ameliorate a particular, related pathology. 

The present invention also contemplates methods of depleting suitable 
samples of pathologic mammalian hyperproliferative cells contaminating 
hematopoietic precursors during bone marrow reconstitution via the introduction 
of a wild-type tumor suppressor gene into the cell preparation using a vector of 
this invention. As used herein, a suitable sample is defined as a heterogeneous cell 
preparation obtained from a patient, e.g., a mixed population of cells containing 
both phenotypically normal and pathogenic cells. 

Administration includes — but is not limited to ~ the introduction of 
therapeutic agents of the present invention into a cell or subject via various means, 
including direct injection, intravenously, intraperitoneally, via intra-tumor 
injection, via aerosols, or topically. Therapeutic agents as disclosed herein may 
also be combined for administration of an effective amount of the agents with a 
pharmaceutically-acceptable carrier, as described herein. 

As used herein, "effective amount" generally means the amount of vector 
particle (or proteins produced/released thereby) which achieves a positive 
outcome in the subject to whom the vector is administered. The total volume 
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administered will necessarily vary depending on the mode of administration, as 
those of skill in the relevant art will appreciate, and dosages may vary as well. 

The dose of a biologic vector (particle) is somewhat complex and may be 
described in terms of the concentration (in plaque-forming units per milliliter 
(pfu/ml)), the total dose (in pfus), or the estimated number of particles 
administered per cell (the estimated multiplicity of infection or MOI). Thus, if a 
vector is administered via infusion — say, across nasal epithelium ~ at a constant 
total volume, the respective concentration, etc. may be described as follows: 

In general, when recombinant adenoviral vector particles are administered 
via infusion across the nasal epithelium (e.g. an area of nasal epithelium containing 
2 x 10 7 cells,) administered amounts producing an estimated MOI (multiplicity of 
infection) of about 10 or greater are much more effective than lower 
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dosages. (See, e.g., Knowles, et al t New Eng. J. Med. 333: 823-831, 1995). 
Similarly, when direct injection is the preferred treatment modality - e.g., direct 
injection of a viral vector into a tumor — doses of 1 x 10 9 pfu or greater are 
generally preferred. (See, e.g., published International App. No. W095/1 1984.) 

Thus, depending on the mode of administration, an effective amount 
administered in a single dose preferably contains from about 10 6 to about 10 15 
infectious units. A typical course of treatment would be one such dose per day 
over a period of five days. As those of skill in the art will appreciate, an effective 
amount may vary depending on ( 1 ) the pathology or other condition to be treated, 
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(2) the status and sensitivity of the patient, and (3) various other factors well 
known to those of skill in the art, such as the patient's tolerance to other courses 
of treatment that may have been applied previously. Thus, those of skill in the art 
may easily and precisely determine effective amounts of the agents/vectors of the 
present invention which may be administered to a particular patient, based on their 
understanding of and evaluation of such factors. 

The present invention also contemplates methods of ameliorating 
pathologies characterized by hyperproliferative cells or genetic defects in a subject, 
by administering to the subject an effective amount of a vector as described herein. 
Such vectors preferably contain a foreign gene encoding a gene product {e.g. 
polypeptide or protein) having the ability to ameliorate the pathology, under 
suitable conditions. As used herein, the term "genetic defect" means any disease, 
condition or abnormality which results from inherited factors, e.g. Huntington's 
Disease, Tay-Sachs Disease, or Sickle Cell Disease. 

The present invention further provides methods for reducing the 
proliferation of tumor cells in a subject by introducing into the tumor mass an 
effective amount of an adenoviral expression vector containing an anti-tumor gene 
other than a tumor suppressor gene. The anti-tumor gene can encode, for 
example, thymidine kinase (TK). An effective amount of a therapeutic agent is 
then administered to the subject; the therapeutic agent, in the presence of the anti- 
tumor gene, is toxic to the cell. 

Using thymidine kinase as exemplary, the therapeutic agent is a thymidine 
kinase metabolite such as ganciclovir (GCV), 6-methoxypurine arabinonucleoside 
(araM), or a functional equivalent thereof. Both the thymidine kinase gene and the 
thymidine kinase substrate must be used concurrently in order to exert a toxic 
effect on the host cell. In the presence of the TK gene, GCV is phosphorylated 
and becomes a potent inhibitor of DNA synthesis, whereas araM is converted to 
the cytotoxic anabolite araATP. Thus, the precise method of action or synergism 
is not relevant to therapeutic efficacy; what is relevant is the fact that the 
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concurrent use of appropriate genes and therapeutic agents may effectively 
ameliorate a specific disease condition. 

Another useful example contemplates use of a vector of the present 
invention which expresses the enzyme cytosine deaminase. Such a vector could 
5 be used in conjunction with administration of the drug 5-fluorouracil (Austin and 

Huber, Mol Pharm. 43: 380-387, 1993) or the recently-described E. coli Deo 
gene in combination with 6-methyl-purine-2'-deoxyribonucleoside (Sorscher etal , 
Gene Therapy 1: 233-238, 1994). 

As with the use of the tumor suppressor genes described previously, the 

1 0 use of other anti-tumor genes, either alone or in combination with the appropriate 

therapeutic agent, provides a treatment for the uncontrolled cell growth or 
proliferation characteristic of tumors and malignancies. Thus, the present 
invention provides therapies to halt the uncontrolled cellular growth in a patient, 
thereby alleviating the symptoms or the disease or cachexia present in the patient. 

1 5 The effect of this treatment includes, but is not limited to, prolonged survival time 

of the patient, reduction in tumor mass or burden, apoptosis of tumor cells, or the 
reduction in the number of circulating tumor cells. Means of quantifying the 
beneficial effects of this therapy are well known to those of skill in the art. 

The present invention provides a recombinant adenovirus expression 

20 vector characterized by the partial or total deletion of one or more adenoviral 

structural protein genes, such as the gene encoding fiber, which allows the vector 
to accommodate a therapeutic, foreign nucleic acid sequence encoding a 
functional foreign polypeptide, protein, or biologically active fragment thereof 
A therapeutic gene sequence may be introduced into a tumor mass by 

25 combining the adenoviral expression vector with a suitable pharmaceutically 

acceptable carrier. Introduction can be accomplished, for example, via direct 
injection of the recombinant Ad vector into the tumor mass. 

A method of tumor-specific delivery of a tumor-suppressor gene is 
accomplished by contacting target tissue in a subject with an effective amount of 

30 a recombinant Ad-derived vector of this invention. In the case of anti-tumor 
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therapy, the gene is intended to encode an anti-tumor agent, such as a functional 
tumor suppressor gene product or suicide gene product. The term "contacting" 
is intended to encompass any delivery method for the efficient transfer of the 
vector, such as via intra-tumoral injection. 

In another example, adenovirus vectors of the present invention can be 
used to transfer genes to central nervous system (CNS) tumors in vivo. 

The present invention also contemplates methods for determining the 
efficacy of the within-disclosed therapeutic compositions and methods. One such 
method for confirming efficacy utilizes the human/SCID (severe combined 
immunodeficient) mouse model of EBV-induced LPD (lymphoproliferative 
disease) to ascertain whether EBV-antisense therapeutic nucleotide sequences 
block tumor formation. (See, e.g., Pisa, etaL, Blood 79: 173-179 (1992); Rowe, 
et al, Curr. Top. Microbiol. Immunol 166: 325 (1990); and Cannon, et al, J_ 
Clin. Invest, 85: 1333-1337 (1990), the disclosures of which are incorporated by 
reference herein.) 

Finally, the use of Ad vectors of the present invention to prepare 
medicaments for the treatment, therapy and/or diagnosis of various diseases is also 
contemplated by this invention. Moreover, other anti-tumor genes may be used 
in combination with the corresponding therapeutic agent to reduce the 
proliferation of tumor cells. Such other gene-and-therapeutic-agent combinations 
are known to those of skill in the art and may be applied as taught herein. 

A, Therapeutic Compositions 

In various alternative embodiments of the present invention, therapeutic 
sequences and compositions useful for practicing the therapeutic methods 
described herein are contemplated. Therapeutic compositions of the present 
invention may contain a physiologically tolerable carrier together with one or more 
therapeutic nucleotide sequences of this invention, dissolved or dispersed therein 
as an active ingredient. In a preferred embodiment, the composition is not 
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immunogenic or otherwise able to cause undesirable side effects when 
administered to a subject for therapeutic purposes. 

As used herein, the terms "pharmaceutical^ acceptable/' "physiologically 
tolerable" and grammatical variations thereof, as they refer to compositions, 
carriers, diluents and reagents, are used interchangeably and represent that the 
materials are capable of administration to or upon a subject — e.g., a mammal — 
without the production of undesirable physiological effects such as nausea, 
dizziness, gastric upset and the like. 

For example, the present invention comprises therapeutic compositions 
useful in the specific targeting of epithelial or non-epithelial cells as well as in 
delivering a therapeutic nucleotide sequence to those cells. Therapeutic 
compositions designed to preferentially target to epithelial cells may comprise a 
recombinant adeno virus-derived vector particle including a therapeutic nucleotide 
sequence. As described herein, a number of adenovirus-derived moieties are 
described, including particles lacking fiber, particles that contain wild type 
adenovirus fiber, and particles that contain modified or chimeric fiber, each type 
providing a different tissue tropism to the particle. 

The preparation of a pharmacological composition that contains active 
ingredients dissolved or dispersed therein is well understood in the art. Typically 
such compositions are prepared as injectables — either as liquid solutions or 
suspensions — however, solid forms suitable for solution or suspension in liquid 
prior to use can also be prepared. A preparation can also be emulsified, or 
formulated into suppositories, ointments, creams, dermal patches, or the like, 
depending on the desired route of administration. 

Physiologically tolerable carriers are well known in the art. Exemplary of 
liquid carriers are sterile aqueous solutions that contain no materials in addition 
to the active ingredients and water, or contain a buffer such as sodium phosphate 
at physiological pH value, physiological saline or both, such as phosphate-buffered 
saline. Still further, aqueous carriers can contain more than one buffer salt, as well 
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as salts such as sodium and potassium chlorides, dextrose, polyethylene glycol and 
other solutes. 

Liquid compositions can also contain liquid phases in addition to and to 
the exclusion of water. Exemplary of such additional liquid phases are glycerin, 
5 vegetable oils such as cottonseed oil, and water-oil emulsions. 

A therapeutic composition typically contains an amount of a therapeutic 
material, i.e., a nucleotide sequence or adenovirus vector particle of the present 
invention, sufficient to deliver a therapeutically effective amount to the target 
tissue, typically an amount of at least 0.1 weight percent to about 90 weight 
10 percent of therapeutic material per weight of total therapeutic composition. A 

weight percent is a ratio by weight of therapeutic material, e.g., a nucleotide 
sequence, to total composition. Thus, for example, 0.1 weight percent is 0.1 
grams of DNA segment per 100 grams of total composition. 



VII. Other Applications 



1 5 The cell lines, viral vectors and methods of the present invention may also 

be used for purposes other than the direct administration of therapeutic nucleotide 
sequences. In one such application, the production of large quantities of 
biologically active proteins or polypeptides in cells transfected with the within- 
disclosed viral vectors is contemplated herein. For example, human 

20 lymphoblastoid cells may b e transfected with a viral vector of the present invention 

carrying a human hematopoietic growth factor such as the gene for erythropoietin 
(EPO); cells so transfected are thus able to produce biologically active EPO. (See, 
e.g., Lopez et aL, Gene 148: 285-91, 1994). 



25 



Various other applications and uses of the within-described methods, cell 
lines, plasmids, vectors, and compositions of the present invention shall become 
apparent upon closer examination of the Examples that follow. 
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Examples 



The following examples are intended to illustrate, but not limit, the present 
invention. As such, the following description provides details of the manner in 
which particular embodiments of the present invention may be made and used. 
This description, while exemplary of the present invention, is not to be construed 
as specifically limiting the invention. Variations and equivalents, now known or 
later developed, which would be within the understanding and technical 
competence of one skilled in this art are to be considered as falling within the 
scope of this invention. 

Example 1 

Preparation of Adenovirus Packaging Cell Lines 

Cell lines that are commonly used for growing adenovirus are useful as 
host cells for the preparation of adenovirus packaging cell lines. Preferred cells 
include 293 cells, an adenovirus-transformed human embryonic kidney cell line 
obtained from the ATCC, having Accession Number CRL 1573; HeLa, a human 
epithelial carcinoma cell line (ATCC Accession Number CCL-2); A549, a human 
lung carcinoma cell line (ATCC Accession Number CCL 1889); and the like 
epithelial-derived cell lines. As a result of the adenovirus transformation, the 293 
cells contain the El early region regulatory gene. All cells were maintained in 
complete DMEM + 10% fetal calf serum unless otherwise noted. 

The cell lines of this invention allow for the production and propagation 
of novel adenovirus-based gene delivery vectors having deletions in preselected 
gene regions, that are obtained by cellular complementation of adenoviral genes. 
To provide the desired complementation of such deleted adenoviral genomes in 
order to generate a novel viral vector of the present invention, plasmid vectors 
that contain preselected functional units were designed as described herein. Such 
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units include but are not limited to El early region, E4 and the viral fiber gene. 
The preparation of plasmids providing such complementation, thereby being 
"complementary plasmids or constructs/' that are stably inserted into host cell 
chromosomes are described below. 

A. Preparation of an E4-Expressing Plasmid for 
Complementation of E4-Gene-Deleted Adenoviruses 

The viral E4 regulatory region contains a single transcription unit which 
is alternately spliced to produce several different mRNAs. The E4-expressing 
plasmid prepared as described herein and used to transfect the 293 cell line 
contains the entire E4 transcriptional unit as shown in Figure 1 . A DN A fragment 
extending from 175 nucleotides upstream of the E4 transcriptional start site 
including the natural E4 promoter to 153 nucleotides downstream of the E4 
polyadenylation signal including the natural E4 terminator signal, corresponding 
to nucleotides 32667-35780 of the adenovirus type 5 (hereinafter referred to as 
Ad5) genome as described in Chroboczek et al (Virol. 9 186:280-285 (1992), 
GenBank Accession Number M73260), was amplified from Ad5 genomic DNA, 
obtained from the ATCC, via the polymerase chain reaction (PCR). Sequences 
of the primers used were 5 ' CGGTAC ACAGAAIIC AGGAGACAC AACTCC3 ' 
(forward or 5' primer referred to as E4L) (SEQ ID NO: 1) and 
5'GCCT GGATCC GGGAAGTTACGTAACGTGGGAAAAC3 ? (SEQ ID NO: 2) 
(backward or 3' primer referred to as E4R). To facilitate cloning of the PCR 
fragment, these oligonucleotides were designed to create novel sites for the 
restriction enzymes EcoRI and BamHI, respectively, as indicated with underlined 
nucleotides. DNA was amplified via PCR using 30 cycles of 92 C for 1 minute, 
50 C for 1 minute, and 72 C for 3 minutes resulting in amplified full-length E4 
gene products. 

The amplified DNA E4 products were then digested with EcoRI and 
BamHI for cloning into the compatible sites of pBluescript/SK+ by standard 
techniques to create the plasmid pBS/E4. A 2603 base pair (bp) cassette 
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including the herpes simplex virus thymidine kinase promoter, the hygromycin 
resistance gene, and the thymidine kinase polyadenylation signal was excised from 
the plasmid pMEP4 (Invitrogen, San Diego, CA) by digestion with Fspl followed 
by addition of BamHI linkers (5'CGCGGATCCGCG3') (SEQ ID NO: 3) for 
5 subsequent digestion with BamHI to isolate the hygromycin-containing fragment. 

The isolated BamHI-modified fragment was then cloned into the BamHI 
site of pBS/E4 containing the E4 region to create the plasmid pE4/Hygro 
containing 8710 bp (Figure 2). The pE4/Hygro plasmid has been deposited with 
the ATCC as described in Example 3. The complete nucleotide sequence of 
1 0 pE4/Hygro is listed in SEQ ID NO: 4. Position number 1 of the linearized vector 

y; corresponds to approximately the middle portion of the pBS/SK+ backbone as 

}Z shown in Figure 2 as a thin line between the 3' BamHI site in the hygromycin 

insert and the 3' EcoRI site in the E4 insert. The 5' and 3' ends of the E4 gene are 
f;9 located at respective nucleotide positions 3820 and 707 of SEQ ID NO: 4 while 

["15 the 5 1 and 3' ends of the hygromycin insert are located at respective nucleotide 

j J positions 3830 and 6470. In the clone that was selected for use, the E4 and 

jU hygromycin resistance genes were divergently transcribed. 

B. Preparation of a Fiber-Expressing Plasmid for 
Complementation of Fiber-Gene-Deleted Adenoviruses 

20 To prepare a fiber-encoding construct, primers were designed to amplify 

the fiber coding region from Ad5 genomic DNA with the addition of unique 
BamHI and NotI sites at the 5' and 3' ends of the fragment, respectively. The 
Ad5 nucleotide sequence is available with the GenBank Accession Number 
M18369. The 5 1 and 3' primers had the respective nucleotide sequences of 

25 5 ' AT GGGATCC AAGATGAAGCGCGCAAGACCG3 ' (SEQ ID NO: 5) and 

5 ' C ATAACGCGGCCGCTTCTTT ATTCTTGGGC3 5 (SEQ ID NO: 6), where 
the inserted BamHI and NotI sites are indicated by underlining. The 5' primer 
also contained a nucleotide substitution 3 nucleotides 5 ' of the second ATG codon 



-66- 

(C to A) that is the initiation site. The nucleotide substitution was included so as 
to improve the consensus for initiation of fiber protein translation. 

The amplified DNA fragment was inserted into the BamHI and NotI sites 
of pcDNA3 (Invitrogen) to create the plasmid designated pCDNA3/Fiber having 
5 7148 bp, the plasmid map of which is shown in Figure 3. The parent plasmid 

contained the CMV promoter, the bovine growth hormone (BHG) terminator and 
the gene for conferring neomycin resistance. The viral sequence included in this 
construct corresponds to nucleotides 31040-32791 of the Ad5 genome. 

The complete nucleotide sequence of pCDNA3 /Fiber is listed in SEQ ID 
f!B ,10 NO: 7 where the nucleotide position 1 corresponds to approximately the middle 

O of the pcDNA3 vector sequence. The 5' and 3' ends of the fiber gene are located 

at respective nucleotide positions 916 with ATG and 2661 with TAA. 
!=; j= To enhance expression of fiber protein by the constitutive CMV promoter 

^3 provided by the pcDNA vector, a Bglll fragment containing the tripartite leader 

s; 15 (TPL) of adenovirus type 5 was excised from pRD112a (Sheay et ah, 

^ BioTechniques, 15:856-862 (1993) and inserted into the BamHI site of 

' r Z pCDNA3/Fiber to create the plasmid pCLF having 7469 bp, the plasmid map of 

O which is shown in Figure 4. The adenovirus tripartite leader sequence, present at 

the 5' end of all major late adenoviral mRNAs as described by Logan et al , Proc. 
20 Natl Acad. Set, USA, 81:3655-3659(1984)andBerkner,5io7fecAw/qw^, 6:616- 

629 (1988), also referred to as a "partial TPL" since it contains a partial exon 1, 
shows correspondence with the Ad5 leader sequence having three spatially 
separated exons corresponding to nucleotide positions 6081-6089 (the 3' end of 
the first leader segment), 7111-7182 (the entire second leader segment), and 9644- 
25 9845 (the third leader segment and sequence downstream of that segment). The 

corresponding cDNA sequence of the partial tripartite leader sequence present in 
pCLF is listed in SEQ ID NO: 8 bordered by BamHI/Bglll 5' and 3' sites at 
respective nucleotide positions 907-912 to 1228-1233. The nucleotide sequence 
of an isolated partial TPL of the present invention is also listed separately as SEQ 
30 ID NO: 26 with the noted 5' and 3' restriction sites and with the following 
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nucleotide regions identified: 1-6 nt Bglll site; 1-18 nt polylinker; 19-27 nt last 
9 nt of the first leader segment (exon 1); 28-99 nt second leader segment (exon 
2); 100-1 87 nt third leader segment (exon 3); 188-301 nt contains the nt sequence 
immediately following the third leader in the genome with an unknown function; 
5 and 322-327 nt Bglll site. 

The pCLF plasmid has been deposited with the ATCC as described in 
Example 3 . The complete nucleotide sequence of pCLF is listed in SEQ ED NO: 
8 where the nucleotide position 1 corresponds to approximately the middle of the 
pcDNA3 parent vector sequence. The 5 1 and 3 ends of the Ad5 fiber gene are 
10 located at respective nucleotide positions 1237-1239 with ATG and 2980-2982 

with TAA. The rest of the vector construct has been previously described above. 



C, Generation of an Adenovirus Packaging Cell Line Carrying 
Plasmids Encoding Functional E4 and Fiber Proteins 

The 293 cell line was selected for preparing the first adenovirus packaging 
15 line as it already contains the El gene as prepared by Graham et al y J. Gen. 

Virol, 3(5:59-74(1977) and as further characterized by Spector, Virol, 730:533- 
538 (1983). Before electroporation, 293 cells were grown in RPMI medium + 
10% fetal calf serum. Four x 10^ cells were electroporated with 20 |ng each of 
pE4/Hygro DNA and pCLF DN A using a BioRad GenePulser and settings of 300 
20 V, 25 [xF. DNA for electr op oration was prepared using the Qiagen system 

according to the manufacturer's instructions (Bio-Rad, Richmond, CA). 

Following electroporation, cells were split into fresh complete DMEM + 
10% fetal calf serum containing 200 ng/ml Hygromycin B (Sigma, St. Louis, 
MO). 

25 From expanded colonies, genomic DNA was isolated using the 

"MICROTURBOGEN" system (Invitrogen) according to manufacturer's 
instructions. The presence of integrated E4 DNA was assessed by PCR using the 
primer pair E4R and ORF6L (5'TGCTTAAGCGGCCGCGAAGGAGA 
AGTCC3 5 ) (SEQ ID NO: 9), the latter of which is a 5' forward primer near 
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adenovirus 5 open reading frame 6. Refer to Figure 1 for position of the primers 
relative to the E4 genes. 

One clone, designated 211, was selected exhibiting altered growth 
properties relative to that seen in parent cell line 293. The 211 clone contained 
5 the expected product, indicating the presence of inserted DNA corresponding to 

most, if not all, of the E4 fragment contained in the pE4/Hygro plasmid. The 211 
cell line has been deposited with the ATCC as described in Example 3 . This line 
was further evaluated by amplification using the primer pair E4L/E4R described 
above, and a product corresponding to the full-length E4 insert was detected. 

10 Genomic Southern blotting was performed on DNA restricted with EcoRI and 

BamHI. The E4 fragment was then detected at approximately one copy/genome 
compared to standards with the EcoRI/BamHI E4 fragment as cloned into pBS/E4 
for use as a labeled probe with the Genius system according to manufacturer's 
instructions (Boehringer Mannheim, Indianapolis, IN). In DNA from the 21 1 cell 

15 line, the expected labeled internal fragment pE4/Hygro hybridized with the 

isolated E4 sequences. In addition, the probe hybridized to a larger fragment 
which may be the result of a second insertion event (Figure 5). 

Although the 211 cell line was not selected by neomycin resistance, thus 
indicating the absence of fiber gene, to confirm the lack of fiber gene, the 2 1 1 cell 

20 line was analyzed for expression of fiber protein by indirect immunofluorescence 

with an anti-fiber polyclonal antibody and a FITC-labeled anti-rabbit IgG (KPL) 
as secondary. No immunoreactivity was detected. Therefore, to generate 211 
clones containing recombinant fiber genes, the 211 clone was expanded by 
growing in RPMI medium and subjected to additional electroporation with the 

25 fiber-encoding pCLF plasmid as described above. 

Following electroporation, cells were plated in DMEM + 10% fetal calf 
serum and colonies were selected with 200 jig/ml G418 (Gibco, Gaithersburg, 
MD). Positive cell lines remained hygromycin resistant. These candidate sublines 
of 211 were then screened for fiber protein expression by indirect 

30 immunofluorescence as described above. The three sublines screened, 211 A, 
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21 IB and 21 1R, along with a number of other sublines, all exhibited nuclear 
staining qualitatively comparable to the positive control of 293 cells infected with 
AdRSVPgal (1 pfu/cell) and stained 24 hours post-infection. 

Lines positive for nuclear staining in this assay were then subjected to 
5 Western blot analysis under denaturing conditions using the same antibody. 

Several lines in which the antibody detected a protein of the expected molecular 
weight (62 kd for the Ad5 fiber protein) were selected for further study including 
21 1 A, 21 IB and 21 1R. The 21 1 A cell line has been deposited with ATCC as 
described in Example 3. 
10 Immunoprecipitation analysis using soluble nuclear extracts from these 

: S three cell lines and a seminative electrophoresis system demonstrated that the fiber 

7 protein expressed is in the functional trimeric form characteristic of the native fiber 

I II protein as shown in Figure 6. The predicted molecular weight of a trimerized fiber 

J is 1 86 kd. The lane marked 293 lacks fiber while the sublines contain detectable 

1 y 15 fiber. Under denaturing conditions, the trimeric form was destroyed resulting in 

O detectable fiber monomers as shown in Figure 6. Those clones containing 

Ljl endogenousEl, newly expressed recombinant E4 and fiber proteins were selected 

for use in complementing adenovirus gene delivery vectors having the 
O corresponding adenoviral genes deleted as described in Example 2. 

20 D. Preparation of an El-Expressing Plasmid for 

Complementation of El-Gene-Deleted Adenoviruses 

In order to prepare adenoviral packaging cell lines other than those based 
on the El -gene containing 293 cell line as described in Example 1C above, 
plasmid vectors containing El alone or in various combinations with E4 and fiber 
25 genes are constructed as described below. 

The region of the adenovirus genome containing the El a and Elb gene is 
amplified from viral genomic DNA by PCR as previously described. The primers 
used are El L, the 5' or forward primer, and E1R, the 3' or backward primer, 
having the respective nucleotide sequences 5'CCG A GCTAGC 
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GACTGAAAATGAG3 ' (SEQ ID NO: 10) and 5'CCT CTCGAG AGACAGC 
AAGACAC3* (SEQ ID NO: 11). The E1L and E1R primers include the 
respective restriction sites Nhel and Xhol as indicated by the underlines. The sites 
are used to clone the amplified El gene fragment into the Nhel/Xhol sites in 
5 pMAM commercially available from Clontech (Palo Alto, C A) to form the plasmid 

pDEX/El having 1 1 152 bp, the plasmid map of which is shown in Figure 7. 

The complete nucleotide sequence of pDEX/El is listed in SEQ ID NO: 
12 where the nucleotide position 1 corresponds to approximately 1454 nucleotides 
from the 3' end of the pMAM backbone vector sequence. The pDEX/El plasmid 
1; 10 includes nucleotides 552 to 4090 of the adenovirus genome positioned 

downstream (beginning at nucleotide position 1460 and ending at 4998 in the 
pDEX/El plasmid) of the glucocorticoid-inducible mouse mammary tumor virus 
(MMTV) promoter of pMAM. The pMAM vector contains the E. coli gpt gene 
that allows stable transfectants to be isolated using 
15 hypoxanthine/aminopterin/thymidine (HAT) selection. The pMAM backbone 

occupies nucleotide positions 1-1454 and 5005-1 1 152 of SEQ ID NO: 12. 



E. Generation of an Adenovirus Packaging Cell Line Carrying 
Plasmids Encoding Functional El, and Fiber Proteins 

To create separate adenovirus packaging cell lines equivalent to that of the 
20 211 sublines, 21 1 A, 21 IB and 21 1R ; as described in Example 1C, alternative cell 

lines lacking adenoviral genomes are selected for transfection with the plasmid 
constructs as described below. Acceptable host cells include A549, Hela, Vero 
and the like cell lines as described in Example 1. The selected cell line is 
transfected with the separate plasmids, pDEX/Eland pCLF, respectively for 
25 expressing El, and fiber complementary proteins. Following transfection 

procedures as previously described, clones containing stable insertions of the two 
plasmids are isolated by selection with neomycin and HAT. Integration of full- 
length copy of the El gene is assessed by PCR amplification from genomic DNA 
using the primer set E1L/E1R , as described above. Functional insertion of the 
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fiber gene is assayed by staining with the anti-fiber antibody as previously 
described. 

The resultant stably integrated cell line is then used as a packaging cell 
system to complement adenoviral gene delivery vectors having the corresponding 
adenoviral gene deletions as described in Example 2. 

F. Preparation of a Plasmid Containing Two or More Adenoviral 
Genes for Complementing Gene-Deleted Adenoviruses 

The methods described in the preceding Examples rely on the use of two 
plasmids, pE4/Hygro and pCLF, or, pCLF and pDEX/El for generating 
adenoviral cell packaging systems. In alternative embodiments contemplated for 
use with the methods of this invention, complementing plasmids containing two 
or more adenoviral genes for expressing of encoded proteins in various 
combinations are also prepared as described below. The resultant plasmids are 
then used in various cell systems with delivery plasmids having the corresponding 
adenoviral gene deletions. The selection of packaging cell, content of the delivery 
plasmids and content of the complementing plasmids for use in generating 
recombinant adenovirus viral vectors of this invention thus depends on whether 
other adenoviral genes are deleted along with the adenoviral fiber gene, and, if so, 
which ones. 

1. Preparation of a Complementing Plasmid Containing 
Fiber and El Adenoviral Genes 

A DN A fragment containing sequences for the CMV promoter, adenovirus 
tripartite leader, fiber gene and bovine growth hormone terminator is amplified 
from pCLF prepared in Example IB using the forward primer 
5 l GACGGATCGGGAGATCTCC3' (SEQ ID NO: 13), that anneals to the 
nucleotides 1-19 of the pCDNA3 vector backbone in pCLF, and the backward 
primer 5'CCGCCTCAGAAGCCAT AGAGCC3 ' (SEQ ID NO: 14) that anneals 
to nucleotides 1278-1257 of the pCDNA3 vector backbone. The fragment is 
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amplified as previously described and then cloned into the pDEX/El plasmid, 
prepared in Example ID. For cloning in the DNA fragment, the pDEX/El vector 
is first digested with Ndel, that cuts at a unique site in the pMAM vector 
backbone in pDEX/El , then the ends are repaired by treatment with bacteriophage 
5 T4 polymerase and dNTPs. 

The resulting plasmid containing E 1 and fiber genes, designated pE 1 /Fiber, 
provides both dexamethasone-inducible El function as described for DEX/E1 and 
expression of Ad5 fiber protein as described above. A schematic plasmid map of 
pEl/Fiber, having 14455 bp, is shown in Figure 8. 
10 The complete nucleotide sequence of pEl/Fiber is listed in SEQ ED NO: 

%1 15 where the nucleotide position 1 corresponds to approximately to 1459 

nucleotides from the 3' end of the parent vector pMAM sequence. The 5 ! and 3 
J jf ends of the Ad5 El gene are located at respective nucleotide positions 1460 and 

f. : .i 4998 followed by pMAM backbone and then separated from the Ad5 fiber from 

s "l 5 pCLF by the filled-in blunt ended Ndel site. The 5' and 3' ends of the pCLF fiber 

J f gene fragment are located at respective nucleotide positions 10922-14223 

M containing elements as previously described for pCLF. 

O The resultant pEl /Fiber plasmid is then used to complement one or more 

* : - delivery plasmids expressing El and fiber. 

20 The pEl/Fiber construct is then used to transfect a selected host cell as 

described in Example IE to generate stable chromosomal insertions preformed as 
previously described followed by selection on HAT medium. The stable cells are 
then used as packaging cells as described in Example 2. 

25 2. Preparation of a Complementing Plasmid Containing 

E4 and Fiber Adenoviral Genes 

pCLF prepared as described in Example IB is partially digested with Bglll 
to cut only at the site in the pCDNA3 backbone. The pE4/Hygro plasmid 
prepared in Example 1 A is digested with BamHI to produce a fragment containing 
30 E4. The E4 fragment is then inserted into the BamHI site of pCLF to form 
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plasmid pE4/Fiber. The resultant plasmid provides expression of the fiber gene 
as described for pCLF and E4 function as described for pE4/Hygro. 

A schematic plasmid map of pE4/Fiber, having 10610 bp, is shown in 
Figure 9. The complete nucleotide sequence of pE4/Fiber is listed in SEQ ID NO: 
16 where the nucleotide position 1 corresponds to approximately 14 bp from the 
3' end of the parent vector pCDNA3 backbone sequence. The 5' and 3 ends of the 
Ad5 E4 gene are located at respective nucleotide positions 21 and 3 149 followed 
by fused Bglll/BamHI sites and pCDNA3 backbone including the CMV promoter 
again followed by Bglll/BamHI sites. The adenovirus leader sequence begins at 
nucleotide position 4051 and extends to 4366 followed by fused BamHI/Bglll 
sites and the 5' and 3' ends of the fiber gene located at respective nucleotide 
positions 4372 and 6124. 

Stable chromosonal insertions of pE4/Fiber in host cells are obtained as 
described above. 

Example 2 

Preparation of Adenoviral Gene Delivery Vectors 
Using Adenoviral Packaging Cell Lines 

Adenoviral delivery vectors of this invention are prepared to separately 
lack the combinations of El/fiber and E4/fiber. Such vectors are more replication- 
defective than those previously in use due to the absence of multiple viral genes. 
A preferred adenoviral delivery vector of this invention that is replication 
competent but only via a non-fiber means is one that only lacks the fiber gene but 
contains the remaining functional adenoviral regulatory and structural genes. 
Furthermore, the adenovirus delivery vectors of this invention have a higher 
capacity for insertion of foreign DNA. 
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A. Preparation of Adenoviral Gene Delivery Vectors Having 
Specific Gene Deletions and Methods of Use 

To construct the El/ fiber deleted viral vector containing the LacZ 
reporter gene construct, two new plasmids were constructed. The plasmid pA 
5 ElBPgal was constructed as follows. A DNA fragment containing the SV40 

regulatory sequences andE. coli P-galactosidase gene was isolated frompSV(3gal 
(Promega) by digesting with Vspl, filling the overhanging ends by treatment with 
Klenow fragment of DNA polymerase I in the presence of dNTP's and digesting 
with Bam HI. The resulting fragment was cloned into the EcoRV and BamHI 

10 sites in the polylinker of pA ElsplB (Microbix Biosystems, Hamilton, Ontario) 

to form pA E1B Pgal that therefore contained the left end of the adenovirus 
genome with the Ela region replaced by the LacZ cassette (nucleotides 6690 to 
415 1) of pSVp gal. Plasmid DNA may be prepared by the alkaline lysis method 
as described by Birnboim and Doly, Nuc. Acids Res., 7:1513-1523 (1978) or by 

15 the Quiagen method according to the manufacturer's instruction, from 

transformed cells used to expand the plasmid DNA was then purified by CsCl- 
ethidium bromide density gradient centrifiigation. Alternatively, plasmid DNAs 
may be purified from E. coli by standard methods known in the art (e.g. see 
Sambrook et al.) 

20 The second plasmid (pDV44), prepared as described herein, is derived 

from pBHGlO, a vector prepared as described by Bett et al y Proc. Natl. Acad 
ScL f USA, 91:8802-8806 (1994), now described in International Application 
Publication Number WO 9500655, with methodology well known to one of 
ordinary skill in the art and also is commercially available from Microbix, which 

25 contains an Ad5 genome with the packaging signals at the left end deleted and the 

E3 region (nucleotides 28133:30818) replaced by a linker with a unique site for 
the restriction enzyme Pad. An 11.9 kb BamHI fragment, which contains the 
right end of the adenovirus genome, is isolated from pBHGl 0 and cloned into the 
BamHI site of pBS/SK(+) to create plasmid pi 1.3 having approximately 14,658 

30 bp. A schematic of the plasmid map is shown in Figure 13. The pi 1.3 plasmid 
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was then digested with Pad and Sail to remove the fiber, E4, and inverted 
terminal repeat (ITR) sequences. 

This fragment was replaced with a 3,4 kb fragment containing the ITR 
segments and the E4 gene which was generated by PCR amplification from 
5 pBHGlO using the following oligonucleotide sequences (5' TGTACACCG 

GATCCGGCGCACACC3 ' SEQ ID NO: 17) and (5'CACAACGAGCTC 
AATT AATTAATTGCCACATCCTC3 1 SEQ ID NO: 18). These primers 
incorporated sites for Pad and BamHI. Cloning this fragment into the Pad and 
blunt ended Sail sites of the p 1 1 .3 backbone resulted in a substitution of the fused 
10 ITRs, E4 region and fiber gene present in pBHGlO, by the ITRs and E4 region 

alone. The resultant p 1 1 . 3 plasmid containing the ITR and E4 regions, now called 
plasmid pDV43a, was then digested withBamHL This BamHI fragment was then 
used to replace a BamHI fragment in pBHGlO thereby creating pDV44 in a 
pBHGlO backbone. 

15 In an alternative approach to preparing pDV44 with an additional 

subcloning step to facilitate the incorporation of restriction cloning sites, the 
following cloning procedure was performed. pDV44 as above was constructed 
by removing the fiber gene and some of the residual E3 sequences from pBHGl 0 
(Microbix Biosystems). As above, to simplify manipulations, the 1 1.9 kb BamHI 

20 fragment including the rightmost part of the Ad5 genome was removed from 

pBHGlO and inserted into pBS/SK. The resulting plasmid was termed pi 1.3. 
The 3.4 kb DNA fragment corresponding to the E4 region and both ITRs of 
adenovirus type 5 was amplified as described above from pBHGlO using the 
oligonucleotides listed above and subcloned into the vector pCR2. 1 (Invitrogen) 

25 to create pDV42. This step is the additional cloning step to facilitate the 

incorporation of a Sail restriction site. pDV42 was then digested with Pad, which 
cuts at a unique site (bold type) in one of the PCR primers, and with Sail, which 
cuts at a unique site in the pCR2. 1 polylinker. This fragment was used to replace 
the corresponding Pacl/Xhol fragment of pi 1 .3 (the pBS polylinker adjacent to 

30 the Ad DNA fragment contains a unique Xhol site), creating pDV43. Finally, 
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pDV44 was constructed by replacing the 1 1 .9 kb BamHI fragment of pBHGl 0 by 
the analogous BamHI fragment of pDV43. As generated in the first procedure, 
pDV44 therefore differs from pBHGlO by the deletion of Ad5 nucleotides 
30819:32743 (residual E3 sequences and all but the 3' -most 41 nucleotides of the 
5 fiber open reading frame). 

Thus, to summarize, the cloning procedures described above result in the 
production of a fiber-deleted Ad5 genomic plasmid (pD V44) that was constructed 
by removing the fiber gene and some of the residual E3 sequences from pBHGlO 
(Figure 16A). pDV44 contains a wild-type E4 region, but only the last 41 

10 nucleotides of the fiber ORF (this sequence was retained to avoid affecting 

expression of the adjacent E4 transcription unit). Both pBHGlO and pDV44 
contain unpackageable Ad5 genomes, and must be rescued by cotransfection and 
subsequent homologous recombination with DNA carrying functional packaging 
signals. In order to generate vectors marked with a reporter gene, either pDV44 

15 or pBHGl 0 was cotransfected with pAElBBgal, which contains the left end of the 

Ad5 genome with an SV40-driven B-galactosidase reporter gene inserted in place 
of the El region. 

In general, and as described below, the method for virus production by 
recombination of plasmids followed by complementation in cell culture involves 

20 the isolation of recombinant viruses by cotransfection of any one of the adenovirus 

packaging cell systems prepared in Example 1, namely 21 1 A, 21 IB, 21 1R, A549, 
Vero cells, and the like, with plasmids carrying sequences corresponding to viral 
gene delivery vectors. 

A selected cell line is plated in dishes and cotransfected with pD V44 and 

25 pAElBP gal using the calcium phosphate method as described by Bett et al. y 

Proc. Natl Acad Scl, USA, 97:8802-8806 (1994). Recombination between the 
overlapping adenovirus sequences in the two plasmids leads to the creation of a 
full-length viral chromosome where pDV44 and pAElBP gal recombine to form 
a recombinant adenovirus vector having multiple deletions. The deletion of El 

30 and of the fiber gene from the viral chromosome is compensated for by the 
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sequences integrated into the packaging cell genome, and infectious virus particles 
are produced. The plaques thus generated are isolated and stocks of the 
recombinant virus are produced by standard methods. 

A pDV44-derived virus is expected to be replication-defective due to the 
fiber deletion, so that the cells in which it is grown must complement this defect. 
The 21 IB cell line (a derivative of 293 cells which expresses the wild-type (wt) 
AD5 fiber and is equivalent to 211 A on deposit with ATCC as described in 
Example 3) was used for rescue and propagation of the virus described here. 
pDV44 and pAElBgal were cotransfected into 21 IB cells, and the monolayers 
were observed for evidence of cytopathic effect (CPE). Briefly, for virus 
construction, cells were transfected with the indicated plasmids using the Gibco 
Calcium Phosphate Transfection system according to the manufacturer's 
instructions and observed daily for evidence of CPE. 

One of a total of 58 transfected dishes showed evidence of spreading cell 
death at day 1 5 . A crude freeze-thaw lysate was prepared from these cells and the 
resulting virus (termed Ad5.Bgal.AF) was plaque purified twice and then 
expanded. To prepare purified viral preparations, cells were infected with the 
indicated Ad and observed for completion of CPE. Briefly, at day zero, 21 IB 
cells were plated in DMEM plus 10% fetal calf serum at approximately 1 X 10 7 
cells/150 cm 2 flask or equivalent density. At day one, the medium was replaced 
with one half the original volume of fresh DMEM containing the indicated Ad, in 
this case Ad5.Bgal.AF, at approximately 100 particles/cell. At day two, an equal 
volume of medium was added to each flask and the cells were observed for CPE. 
Two to five days after infection, cells were collected and virus isolated by lysis via 
four rapid freeze-thaw cycles. Virus was then purified by centrifugation on 
preformed 1 5-40% CsCl gradients (1 1 1,000 x g for three hours at 4°C). The 
bands were harvested, dialyzed into storage buffer (10 mM Tris-pH 8.1, 0.9% 
NaCl, and 10% glycerol), aliquoted and stored at - 70 °C. Purified Ad5.flgal.AF 
virus particles containing human adenovirus Ad5.6gal.AFgenome (described 
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further below) have been deposited with the ATCC on January 1 5, 1 999 as further 
described in Example 3. 

For viral titering, as necessary in the below Examples, Ad preparations 
were titered by plaque assay on 21 IB cells. Cells were plated on 
5 polylysine-coated 6 well plates at 1 . 5 x 1 0 6 cells/well. Duplicate dilutions of virus 

stock were added to the plates in 1 ml/well of complete DMEM. After a five hour 
incubation at 37 °C, virus was removed and the wells overlaid with 2 ml of 0.6% 
low-melting agarose in Medium 199 (Gibco). An additional 1 ml of overlay was 
added at five day intervals. 

10 As a control, the first-generation virus Ad5 .B gal.wt, which is identical to 

Ad5.BgaLAF except for the fiber deletion, was constructed by cotransfection of 
pBHGlO and pAElBBgal (Figure 16B). In contrast to the low efficiency of 
recovery of the fiberless genome (1/58 dishes), all of 9 dishes cotransfected with 
pAElBPgal and pBHGlO produced virus. 

15 In a preferred embodiment of this invention as more fully described herein 

and below, a delivery plasmid is prepared that does not require the above- 
described recombination events to prepare a viral vector having a fiber gene 
deletion. In one embodiment, a single delivery plasmid containing all the 
adenoviral genome necessary for packaging but lacking the fiber gene is prepared 

20 from plasmid pFG140 containing full-length Ad5 that is commercially available 

from Microbix. The resultant delivery plasmid referred to as pFG140-f is then 
used with pCLF stably integrated cells as described above to prepare a viral vector 
lacking fiber. In a preferred aspect of this invention, the fiber gene is replaced 
with a therapeutic gene of interest for preparing a therapeutic delivery adenoviral 

25 vector. Other embodiments including production of fiberless vector with a 

complete TPL are described in Example 5. 

Vectors for the delivery of any desired gene and preferably a therapeutic 
gene are prepared by cloning the gene of interest into the multiple cloning sites in 
the polylinker of commercially available pAElsplB (Microbix Biosystems), in an 

30 analogous manner as performed for preparing p El Bp gal as described above. 
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The same cotransfection and recombination procedure is then followed as 
described herein to obtain viral gene delivery vectors as further discussed in later 
Examples. 

1. Characterization of the Ad5.PgaI.AF Genome 

To confirm that the vector genomes had the expected structures and that 
the fiber gene was absent from the Ad5.Bgal.AF chromosome, the DNA isolated 
from viral particles was analyzed. Briefly, purified viral DNA was obtained by 
adding 10 nl of 10 mg/ml proteinase K, 40 \il of 0.5 M EDTA and 50 |il of 10% 
SDS to 800 |il of adenovirus-containing culture supernatant. The suspension was 
then incubated at 5 5 ° C for 60 minutes. The solution was then extracted once with 
400 (il of a 24: 1 mixture of chloroform: isoamyl alchohol. The aqueous phase was 
then removed and precipitated with sodium acetate/ethanol. The pellet was 
washed once with 70% ethanol and lightly dried. The pellet was then suspended 
in 40 pi of 10 mM Tris-HCl, pH 8.0, 1 mM EDTA. Genomic DNA from both 
Ad5.Bgal.wt and Ad5.Bgal.AF produced the expected restriction patterns (Figure 
17A) following digestion with either EcoRI (Figure 17B) or with Ndel (data not 
shown). Southern blotting, performed with standard methods, with labeled fiber 
DNA as a probe demonstrated the presence of fiber sequence in Ad5.Bgal.wt but 
not in Ad5.Bgal.AF DNA (Figure 17C). As a positive control, the blot was 
stripped and reprobed with labeled E4 sequence. Fiber and E4 sequences were 
detected by using labeled inserts from pCLF and pE4/Hygro, respectively. As 
expected, E4 signal was readily detectable in both genomes at equal intensities 
(Figure 17C). 

The complete nucleotide sequence of Ad5.J3gal.AF is presented in SEQ ID 
NO: 27 and is contained in the virus particle on deposit with ATCC. 
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2. Characterization of the Fiberless Adenovirus 
Ad5.Pgal.AF 

To verify that Ad5.Bgal.AF was fiber-defective, 293 cells (which are 
permissive for growth of El -deleted Ad vectors but do not express fiber) were 
5 infected with Ad5.Bgal.AF or with Ad5.Bgal.wt Twenty-four hours post 

infection, the cells were stained with polyclonal antibodies directed either against 
fiber or against the penton base protein. Cells infected with either virus were 
stained by the anti-penton base antibody, while only cells infected with the 
Ad5 .Bgal.wt control virus reacted with the anti-fiber antibody. This confirms that 
10 the fiber-deleted Ad mutant does not direct the synthesis of fiber protein. 

3. Growth of the Fiber-Deleted Ad5.Pgal.AF Vector in 
Complementing Cells 

Ad5.Bgal.AF was found to readily be propagated in 21 IB cells. As 
15 assayed by protein concentration, CsCl-purified stocks of either Ad5.Bgal.AF or 

Ad5. Bgal.wt contained similar numbers of viral particles (Table 1), and the 
particles appeared to band normally on CsCl gradients. However, infectivity of 
the Ad5.Bgal.AF particles was lower than the Ad5. Bgal.wt control, as indicated 
by an increased particle/PFU ratio (Table 1). This is likely due to a reduced 
20 amount of fiber protein incorporated into mutant particles during growth in the 



Virus 
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purified 
prepn 


Cell 
line 


Particles/ 
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PFU/ml b 
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Fiber source 


Ad5.pgal wt 
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211B 


7.4 x 10 11 


7.5 x 
10 10 
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Ad 

chromosome 
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211B 


3.0 xlO 11 


5.0 x 10 9 
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Ad 
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211B 


7.7 x 10 11 


3.5 x 10 8 


2,200 


Packaging 
cells 
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4 


211B 


1.9 x 10 12 


2.3 x 10 9 


808 


Packaging 
cells 




5 


293 


4.5 x 10 11 


9.5 x 10 s 


47,400 


None 




6 


293 


3.4 x 10 n 


3.5 x 10 7 


9,700 


None 



Calculated from viral protein concentration (lug of protein = 4 x 10 9 particles). 
b Assayed by plaquing on 21 IB cells. 

Table 3 



*Particle numbers and infectious titers of representative adenovirus preps . Each line 
5 represents a single CsCl-purified preparation of the indicated virus. Particle numbers were 

calculated from viral protein concentration ( 1 \xg protein = 4 x 1 0 9 particles) . Pfu was assayed 
by plaquing on 21 IB cells (see above). 

21 IB cells (see below). Ad5.Bgal.AF was also found to plaque more slowly than the 
control virus. When plated on 21 IB cells, Ad5.Bgal.wt plaques appeared within 5-7 
10 days, while plaques of Ad5.figal.AF continued to appear until as much as 15-18 days 

post infection. Despite their slower formation, the morphology of Ad5.Bgal.AF 
plaques was essentially normal. 



4. Production of Fiberless Ad5.Bgal.AF Particles 



As Ad5.BgaLAF represents a true fiber null mutation and its stocks are free of 
15 helper virus, the fiber mutant phenotype was readily investigated. A single round of 

growth in cells (such as 293) which do not produce fiber generating a homogeneous 
preparation of fiberless Ad allowed for the determination of whether such particles 
would be stable and/or infectious. Either Ad5.Bgal.wt or Ad5.Bgal.AF was grown in 
293 or 21 IB cells, and the resulting particles purified on CsCl gradients as previously 
20 described, Ad5.Bgal.AF particles were readily produced in 293 cells at approximately 
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the same level as the control virus and behaved similarly on the gradients, indicating 
that there was not a gross defect in morphogenesis of fiberless capsids (Table 1). 

As shown in Figure 18, particles of either virus contained similar amounts of 
penton base regardless of the cell type in which they were grown. This demonstrated 
that fiber is not required for assembly of the penton base complex into virions. 
However, as predicted, the Ad5.BgaI.AF particles produced in 293 cells did not contain 
fiber protein. 21 IB-grown Ad5.J3gal.AF also contained less fiberthan the Ad5.figal.wt 
control virus (Figure 18). Importantly, the infectivities of the different viral 
preparations on epithelial cells (Table 1) correlated with the amount of fiber protein 
present. The fiberless Ad particles were several thousand-fold less infectious than the 
first-generation vector control on a per-particle basis, while infectivity of 2 1 1 B-grown 
Ad5.13gal.AF was only 50-100 fold less than that of Ad5.BgaLwt. These studies 
confirmed fiber's crucial role in 
infection of epithelial cells via CAR binding. 

5. Composition and Structure of the Fiberless Ad5.figal.AF 
Particles 

The proteins contained in particles of 293 -grown Ad5.13gal.AF were compared 
to those in Ad5.Bgal.wt, to determine whether proteolysis or particle assembly was 
defective in this fiber null mutant (data not shown). The overall pattern of proteins in 
the fiberless particles was observed to be quite similar to that of a first-generation 
vector, with the exception of reduced intensity of the composite band resulting from 
both proteins Ilia and IV (fiber) (data not shown). The fiberless particles also had a 
reduced level of protein VII. Although substantial amounts of uncleaved precursors 
to proteins VI, VII, and VIII were not seen, it is possible that the low-molecular weight 
bands migrating ahead of protein VII represent either aberrantly cleaved viral proteins 
or their breakdown products. 

Cryo-electron microscopy was used to more closely examine the structure of 
the 293 grown Ad5.Bgal.AF and of AdSBgal.wt. The fiber, which consists of an 
extended stalk with a knob at the end, was faintly visible in favorable orientations of 
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wild-type Ad5 particles, but not in images of the fiberless particles (data not shown). 
Filamentous material likely corresponding to free viral DNA was seen in micrographs 
of fiberless particles. This material was also present in micrographs of the 
first-generation control virus, albeit at much lower levels. 

Three-dimensional image reconstructions of fiberless and wild-type particles at 
-20 A resolution showed similar sizes and overall features, with the exception that 
fiberless particles lacked density corresponding to the fiber protein. The densities 
corresponding to other capsid proteins, including penton base and proteins Ilia, VI, 
and IX, were comparable in the two structures. This confirms that absence of fiber does 
not prevent assembly of these components into virions. The fiber was truncated in the 
wild-type structure as only the lower portion of its flexible shaft follows icosahedral 
symmetry. The RGD protrusions on the fiberless penton base were angled slightly 
inward relative to those of the wild-type structure. Another difference between the two 
penton base proteins was that there is a -30 A diameter depression in the fiberless 
penton base around the five-fold axis where the fiber would normally sit. The Ad5 
reconstructions confirm that capsid assembly, including addition of penton base to the 
vertices, is able to proceed in the complete absence of fiber. 

6. Integrin-Dependent Infectivity of Fiberless Ad5.figal.AF 
Particles 

While attachment via the viral fiber protein is a critical step in the infection of 
epithelial cells, an alternative pathway for infection of certain hematopoietic cells has 
been described. In this case, penton base mediates both binding to the cells (via J32 
integrins) and internalization (through interaction with avintegrins). Particles lacking 
fiber might therefore be expected to be competent for infection of these cells, even 
though on a per-particle basis they are several thousand-fold less infectious than normal 
Ad vectors on epithelial cells. 

To investigate this, THP-1 monocytic cells were infected with Ad5.J3gal.wt or 
with Ad5.Bgal.AF grown in the absence of fiber. Infection of TUP- 1 cells was assayed 
by infecting 2 x 10 5 cells at the indicated m.o.i. in 0.5 ml of complete RPML 
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Forty-eight hours post-infection, the cells were fixed with glutaraldehyde and stained 
with X-gal, and the percentage of stained cells was determined by light microscopy. 

The results of the infection assay showed that the fiberless particles were only 
a few-fold less infectious than first-generation Ad on THP-1 cells (Figure 19 A). In 
contrast to this, very large differences were seen in plaquing efficiency on epithelial 
(2 1 IB) cells (Table 1 ). Infection of THP- 1 cells by either Ad5.Bgal.AF or Ad5.J3gal.wt 
was not blocked by an excess of soluble recombinant fiber protein, but could be 
inhibited by the addition of recombinant penton base (Figure 19B). These results 
indicate that the fiberless Ad particles use a fiber-independent pathway to infect these 
cells. Furthermore, the lack of fiber protein did not prevent AdS.BgalAF from 
internalizing into the cells and delivering its genome to the nucleus, demonstrating that 
fiberless particles are properly assembled and are capable of uncoating. 

The foregoing results with the recombinant viruses thus produced indicates that 
they can be used as gene delivery tools both in cultured cells and in vivo as described 
more fully in the Examples. For example, for studies of the effectiveness and relative 
immunogenicity of multiply-deleted vectors, virus particles are produced by growth in 
the packaging lines described in Example 1 and are purified by CsCl gradient 
centrifugation. Following titering, virus particles are administered to mice via systemic 
or local injection or by aerosol delivery to lung. The LacZ reporter gene allows the 
number and type of cells which are successfully transduced to be evaluated. The 
duration of transgene expression is evaluated in order to determine the long-term 
effectiveness of treatment with multiply-deleted recombinant adenoviruses relative to 
the standard technologies which have been used in clinical trials to date. The immune 
response to the improved vectors described here is determined by assessing parameters 
such as inflammation, production of cytotoxic T lymphocytes directed against the 
vector, and the nature and magnitude of the antibody response directed against viral 
proteins. 

Versions of the vectors which contain therapeutic genes such as CFTR for 
treatment of cystic fibrosis or tumor suppressor genes for cancer treatment are 
evaluated in the animal system for safety and efficiency of gene transfer and expression. 
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Following this evaluation, they are used as experimental therapeutic agents in human 
clinical trials. 

B. Retargeting of Adenoviral Gene Delivery Vectors by Producing 
Viral Particles Containing Different or Altered Fiber Proteins 

As the specificity of adenovirus binding to target cells is largely determined by 
the fiber protein, viral particles that incorporate modified fiber proteins or fiber proteins 
from different adenoviral serotypes (pseudotyped vectors) have different specificities. 
Thus, the methods of expression of the native Ad5 fiber protein in adenovirus 
packaging cells as described above is also applicable to production of different fiber 
proteins. 

In one aspect of invention, chimeric fiber proteins are produced according to 
the methods of Stevenson etaL,J. Virol, 69:2850-2857 (1995). The authors showed 
that the determinants for fiber receptor binding activity are located in the head domain 
of the fiber and that isolated head domain is capable of trimerization and binding to 
cellular receptors. The head domains of adenovirus type 3 (Ad3) and Ad5 were 
exchanged in order to produce chimeric fiber proteins. Similar constructs for encoding 
chimeric fiber proteins for use in the methods of this invention are contemplated. Thus, 
instead of the using the intact Ad5 fiber-encoding construct prepared in Example 1 as 
a complementing viral vector in adenoviral packaging cells, the constructs described 
herein are used to transfect cells along with E4 and/or El -encoding constructs. 

Briefly, full-length Ad5 and Ad3 fiber genes were amplified from purified 
adenovirus genomic DNA as a template. The Ad5 and Ad3 nucleotides sequences are 
available with the respective GenBank Accession Numbers Ml 8369 and Ml 24 11. 
Oligonucleotide primers are designed to amplify the entire coding sequence of the full- 
length fiber genes, starting from the start codon, ATG, and ending with the termination 
codon TAA. For cloning purposes, the 5 f and 3* primers contain the respective 
restriction sites BamHI and NotI for cloning into pcDNA plasmid as described in 
Example 1 A. PCR is performed as described above. 
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The resultant products are then used to construct chimeric fiber constructs by 
PCRgene overlap extension, as described by Horton etal.BioTechniques, 5:525-535 
(1990). The Ad5 fiber tail and shaft regions (5TS; the nucleotide region encoding 
amino acid residue positions 1 to 403) are connected to the Ad3 fiber head region (3H; 
the nucleotide region encoding amino acid residue positions 136 to 3 19) to form the 
5TS3H fiber chimera. Conversely, the Ad3 fiber tail and shaft regions (3TS; the 
nucleotide region encoding amino acid residues positions 1 to 135) are connected to 
the Ad5 fiber head region (5H; the nucleotide region encoding the amino acid residue 
positions 404 to 581) to form the 3TS5H fiber chimera. The fusions are made at the 
conserved TLWT (SEQ ID NO: 19) sequence at the fiber shaft-head junction. 

The resultant chimeric fiber PCR products are then digested with BamHI and 
NotI for separate directional ligation into a similarly digested pcDNA 3.1. The TPL 
sequence is then subcloned into the BamHI as described in Example 1 A for preparing 
an expression vector for subsequent transfection into 211 cells as described above or 
into the alternative packaging cell systems as previously described. The resultant 
chimeric fiber construct-containing adenoviral packaging cell lines are then used to 
complement adenoviral delivery vectors as previously described. Other fiber chimeric 
constructs are obtained using a similar approach with the various adenovirus serotypes 
known. 

In an alternative embodiment, the methods of this invention contemplate the use 
of the modified proteins including novel epitopes as described by Michael et aL, Gene 
Therapy, 2:660-668 (1995) and in International Publication WO 95/26412, the 
disclosures of which are incorporated by reference herein. Both publications describe 
the construction of a cell-type specific therapeutic viral vector having a new binding 
specificity incorporated into the virus concurrent with the destruction of the 
endogenous viral binding specificity. In particular, the authors described the 
production of an adenoviral vector encoding a gastrin releasing peptide (GRP) at the 
3' end of the coding sequence of the Ad5 fiber gene. The resulting fiber-GRP fusion 
protein was expressed and shown to assemble functional fiber trimers that were 
correctly transported to the nucleus of HeLa cells following synthesis. 
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Based on the teachings in the paper and International Publication, similar 
constructs are contemplated for use in the complementing adenoviral packaging cell 
systems of this invention for generating new adenoviral gene delivery vectors that are 
targetable, replication-deficient and less immunogenic. Heterologous ligands 
contemplated for use herein to redirect fiber specificity range from as few as 10 amino 
acids in size to large globular structures, some of which necessitate the addition of a 
spacer region so as to reduce or preclude steric hindrance of the heterologous ligand 
with the fiber or prevent trimerization of the fiber protein. The ligands are inserted at 
the end or within the linker region. Preferred ligands include those that target specific 
cell receptors or those that are used for coupling to other moieties such as biotin and 
avidin. 

A preferred spacer includes a short 12 amino acid peptide linker composed of 
a series of serines and alanine flanked by a proline residue at each end. One of ordinary 
skill in the art is familiar with the preparation of linkers to accomplish sufficient protein 
presentation and for altering the binding specificity of the fiber protein without 
compromising the cellular events that follow viral internalization. Moreover, within the 
context of this invention, preparation of modified fibers having ligands positioned 
internally within the fiber protein and at the carboxy terminus as described below are 
contemplated for use with the methods described herein. 

The preparation of a fiber having a heterologous binding ligand is prepared 
essentially as described in the above-cited paper. Briefly, for the ligand of choice, site- 
directed mutagenesis is used to insert the coding sequence for a linker into the 3* end 
of the Ad5 fiber construct in pCLF as prepared in Example 1. 

The 3' or antisense or mutagenic oligonucleotide encodes a preferred linker 
sequence of ProSerAlaSerAlaSerAlaSerAlaProGlySer (SEQ ID NO: 20) followed by 
a unique restriction site and two stop codons, respectively, to allow the insertion of a 
coding sequence for a selected heterologous ligand and to ensure proper translation 
termination. Flanking this linker sequence, the mutagenic oligonucloetide contains 
sequences that overlap with the vector sequence and alio its incorporation into the 
construct. Following mutagenesis of the pCLF sequence adding the linker and stop 
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codon sequences, a nucleotide sequence encoding a preselected ligand is obtained, 
linkers corresponding to the unique restriction site in the modified construct are 
attached and then the sequence is cloned into linearized corresponding restriction site. 

The resultant fiber-ligand construct is then used to transfect 211 or the 
alternative cell packaging systems previously described to produce complementing viral 
vector packaging systems for use with the methods of this invention. 

In a further embodiment, intact fiber genes from different Ad serotypes are 
expressed by 21 1 cells or an alternative packaging system as previously described. 
A gene encoding the fiber protein of interest is first cloned to create a plasmid 
analogous to pCLF, and stable cell lines producing the fiber protein are generated as 
described above for Ad5 fiber. The adenovirus vector described which lacks the fiber 
gene is then propagated in the cell line producing the fiber protein relevant for the 
purpose at hand. As the only fiber gene present is the one in the packaging cells, the 
adenoviruses produced contain only the fiber protein of interest and therefore have the 
binding specificity conferred by the complementing protein. Such viral particles are 
used in studies such as those described above to determine their properties in 
experimental animal systems. 

Example 3 
Deposit of Materials 

The following cell lines and plasmids have been deposited on September 25, 
1996, with the American Type Culture Collection, 10801 University Blvd, 
Manassas, Virginia, USA (ATCC) under the provisions of the Budapest Treaty on 
the International Recognition of the Deposit of Microorganisms for the Purpose of 
Patent Procedure and the Regulations thereunder (Budapest Treaty): Plasmid 
pE4/Hygro (accession number 97739), Plasmid pCLF (accession number 97737), 
21 1 Cell Line (accession number CRL-12193) and 21 1A Cell Line (accession 
number CRL-12194) 
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The following virus, Ad5.Pgal.AF, wasdeposited on January 15, 1999, with the 
ATCC as listed above and provided with accession number VR2636. 

Additionally, plasmids pDV60, pDV67, pDV69, pDV80 and pDV90 were 
deposited at the ATCC on January 5, 2000 and provided with accession numbers PTA- 
1 144, PTA-1 145, PTA-1 146, PTA-1 147 and PTA-1 148 respectively. 

Example 4 

Complementation of Fiber-Defective and Fiber-Modified Virus 

The native fiber protein is a homotrimer (Henry L J. et al , J. Virol 
(55:5239-5246 (1994)), and trimerization is essential for assembly of the penton/fiber 
complex (Novelli A et al, 1 Biol Chem, 266:9299-9303 (1991)). To assess the 
multimeric structure of the recombinant fiber protein produced by the cell lines, cells 
were labeled with 50 nCi/ml [ 35 S] Translabel (ICN) for two hours at 37°C, lysed in 
RIP A buffer, and fiber protein was immunoprecipitated as described (Harlow E et a., 
Antibodies. Cold Spring Harbour Laboratory, Cold Spring Harbor (1988). Immune 
complexes were collected on Protein A-Sepharose beads (Pierce), extensively washed 
with RIP A buffer, and incubated at room temperature in 0. 1 M triethylamine, pH 1 1 . 5 
to release bound fiber protein. A portion of the precipitated fiber was electrophoresed 
on a 8% SDS-PAGE gel under denaturing (1% SDS in loading buffer, samples boiled 
for 5 minutes) or semi-native (0.1% SDS in loading buffer, samples not heated) 
conditions. 

As seen in Fig. 13, lines 21 1A, 21 IB, and 21 1R, but not the control 293 cells, 
expressed an immunologically reactive protein which migrated at the predicted 
molecular weight for trimer (186 kD) under seminative conditions and for monomer 
(62 kD) under denaturing conditions. The behavior of the precipitated fiber was 
indistinguishable from that of purified baculovirus-produced recombinant Ad2 fiber 
(Wickham T et al, Cell 73:309-3 19 (1993)) (the 58 kD Ad2 and 62 kD Ad5 fibers 
have very similar mobilities under these conditions). 

To determine whether the fiber-expressing lines could support the growth of a 
fiber-defective adenovirus, we performed one-step growth experiments using the 
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temperature-sensitive fiber mutant Ad H5&142 (the gift of Harold Ginsberg). At the 
restrictive temperature (39.5 °C), this mutant produces an underglycoslyated fiber 
protein which is not incorporated into mature virions (Chee-Sheung C. C et al 7 J. 
Virol 42: 932-950 (1982)). This results in the accumulation of non-infectious viral 
particles. We asked whether the recombinant fiber protein expressed by our cell lines 
could complement the H5M42 defect and rescue viral growth. 

Cell lines 293, 21 1 A, 21 IB and 21 1R (2 x 10 6 cells/sample) were infected with 
H5/5142 at 10 pfu/cell. 48 hours later, cells were detached with 25 mM EDTA and 
virus was harvested by four rapid freeze-thaw cycles. Debris was removed by a 10 
minute spin at 1500 x g, and viral titers determined by fluorescent focus assay (Thiel 
J.F et al. 9 Proc. Soc. Exp, Biol Med, 725:892-895 (1967)) on SW480 cells with a 
polyclonal anti-penton base Ab (Wickham T et al, Cell 73:309-319 (1993)). As 
shown in Fig. 14, the fiber mutant virus replicated to high titers in 293 cells at 32.5 °C 
(the permissive temperature), but to a much lower extent at the restrictive temperature 
of39.5°C. The fiber-producing packaging lines 211 A, 21 IB, or 21 1R supported virus 
production at 39° C to levels within two- to three-fold of those seen at the permissive 
temperature in 293 cells, indicating that these cells provided partial complementation 
of the fiber defect. 

Interestingly, virus yields from the fiber-producing cell lines were also somewhat 
higher than those from 293 cells at 32.5 °C (the 'permissive' temperature). This 
suggests that fiber produced by the tsl42 virus may be partially defective even at the 
permissive temperature. Alternatively, a non-specific increase in adenoviral titer could 
result when viruses are grown in the packaging cells, by a mechanism not involving 
fiber complementation. However, it was found that viruses with wild type fiber genes 
(such as Ad.RSVPgal) replicate to identical levels either in our packaging lines or in 
293 cells (data not shown). Taken together, these results demonstrate that the 
observed increase in H5M42 growth is due to specific complementation of the fiber 
mutation. 

Even in the fiber-expressing cell lines, the fiber mutant grows to higher titers at 
32°C than at 39.5 °C. This incomplete complementation may be due to the packaging 
lines' expression of fiber at a level somewhat below that seen in a wild-type infection 
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(data not shown). A recent study reported an E4-deleted vector which coincidental^ 
reduced fiber protein expression, resulting in a large reduction in the titer of virus 
produced (Brough et al y J. Virol 70:6497-6501 (1996)), Another possibility is that 
the defective *sl42 fiber protein produced at the restrictive temperature might form 
complexes with some of the wild type protein produced by the cells and prevent its 
assembly into particles. 

Although the fiber proteins of different Ad serotypes differ in the length of their 
shaft domains and in their receptor-binding knob domains, the N-terminal regions 
responsible for interaction with the viral penton base are highly conserved Arnberg N 
et al y Virology 227:239-244 (1997)) (Figure 15A). This suggests that fibers from 
many viral serotypes, with their different cell-binding specificities, may be amenable for 
use in producing gene delivery vectors. 

In order to determine whether the recombinant Ad5 fiber produced by the 
packaging cells could be incorporated into particles of another adenovirus serotype, 
adenovirus type 3 was grown either in fiber-producing cell lines or in 293 cells. Viral 
particles were purified by two sequential centrifugations (3 h at 111,000 x g) on 
preformed 1 5-40% CsCl gradients to remove soluble cellular proteins and then dialyzed 
extensively against lOmMTris-HCl, pH8.1, 150mMNaCl, 10% glycerol. Ad5 fiber 
protein was detected by immunoblotting using the polyclonal anti-fiber serum, followed 
by detection with a horseradish peroxidase-conjugated goat anti-rabbit antibody 
(Kirkegaard and Perry Laboratories) and the ECL chemiluminescence substrate 
(Amersham). The purified Ad3 particles contained Ad5 fiber protein after a single 
passage through a fiber-expressing cell line but not after passage through 293 cells 
(Figure 1 5B) . Previous work has demonstrated that Ad2 fiber is capable of interacting 
in vitro with Ad3 penton base (Fender et al , Nature Biotech. 75:52-56(1 997)), and our 
result demonstrates that the type 5 fiber protein produced by the cells is capable of 
assembling into complete Ad3 particles. 

A vector based on Ad5 but containing the gene for the Ad7 fiber protein has been 
described (Gall J. et al 9 1 Virol. 70:2116-2123 (1996)), as well as Ads containing 
chimeric fiber genes (Krasnykh et al, J, Virol 70:6839-6846 (1996) and Stevenson 
et al, 1 Virol 62:2850-2857 (1995)). Chimeric Ad5/Ad3 vectors have also been 
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reported (Stevenson, S. et al., J. Virol 77:4782-4790, (1997). Addition of a short 
peptide linker to the fiber in order to confer binding to a different cellular protein has 
also been reported (Michael et al y Gene Therapy 2:660-668 (1995). By using 
packaging technology such as that presented here, Ad vectors equipped with different 
fiber proteins may be produced simply by growth in cells expressing the fiber of 
interest, without the time-consuming step of generating a new vector genome for each 
application. 

Replacing or modifying the fiber gene in the vector chromosome would also 
require that the new fiber protein bind a receptor on the surface of the cells it which it 
is to be grown. The packaging cell approach will allow the generation of Ad particles 
containing a fiber which can no longer bind to its host cells, by a single round of 
growth in cells expressing the desired fiber gene. This will greatly expand the 
repertoire of fiber proteins which can be incorporated into particles, as well as 
simplifying the process of retargeting gene delivery vectors. 

Finally, a novel fiber-independent pathway of infection has recently been 
described in hematopoietic cells, in which penton base provides the initial virus-cell 
interaction by binding to integrin a m b 2 (Huang S. et al, 1 Virol 70: 4502-4508 
(1996)). This suggests that viral particles lacking fiber protein may be useful in 
targeting gene delivery to specific cell types via this pathway. 

Example 5 
Preparation of Alternative TPLs 

The present invention contemplates the use of tripartite leader sequences (TPLs) 
that are useful in enhancing the expression of complementing adenoviral proteins, 
particularly fiber protein, for use in preparing an adenoviral gene delivery vector. One 
preferred TPL is the complete Ad5 tripartite leader contained in complementing vectors 
such as pDV67 and pDV69, both of which are prepared as described below. The 
complete Ad5 TPL was constructed by assembling PCR fragments. First, the third 
TPL exon (exon 3) (nt 9644-9731 of the Ad5 genome) was amplified from Ad5 
genomic DNA using the synthetic oligonucleotide primers 
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5 ' CTC AAC AATTGTGGATCCGTACTCC3 '(SEQ ID NO: 28) and 
5 ' GTGCTC AGCAGATCTTGCGACTGTG3 ' (SEQ ID NO: 29). The resulting 
product was cloned to the BamHI and Bglll sites of pAEl Sp 1 a (Microbix Biosystems) 
using novel sites in the primers (shown in bold) to create plasmid pDV52. A fragment 
corresponding to the first TPL exon (exon 1), the natural first intron (intron 1), and the 
second TPL exon (exon 2) (Ad5 nt 6049-7182) was then amplified using primers 
5 ' GGCGCGTTCGGATCC ACTCTCTTCC3 1 (SEQ ID NO:30) and 5'CTA 
CATGCT AGGCAGATCTCGTTCGGAG3 ' (SEQ ID NO: 31), and cloned into the 
B amHI site of pD V5 2 (again using novel sites in the primers) to create pD V5 5 . This 
plasmid contains a 1.2 kb BamHI/Bglll fragment consisting of the first TPL exon, the 
natural first intron, and the fused second and third TPL exons. The nucleotide 
sequence of the complete TPL containing the noted 5* and 3' restriction sites is shown 
in SEQ ID NO: 32 with the following nucleotide regions identified: 1 -6 nt BamHI site; 
7-47 nt first leader segment (exon 1); 48-1068 nt natural first intron (intron 1); 1069- 
1 140 nt second leader segment (exon 2); 1 141-1 146 nt fused BamHI and Bglll sites; 
1 147-1234 nt third leader segment (exon 3); and 1235-1240 nt Bglll site. 

TPLs fragments containing two of the three exons, exons in non-native order, or 
containing either the first or second TPL intron are also constructed for use in 
preparing complementing plasmids for use in the methods of the present invention. 
Briefly, DNA fragments containing any combination of 2 TPL exons can be 
constructed as follows: Exon 1 is amplified from genomic DNA as prepared above by 
usingthe oligonucleotides 5 ' GGCGCGTTCGGATCC ACTCTCTTCC3' (SEQ ID NO: 
33) and 5 ' GGGAGTAGATCTCCCAACAG3 ' (SEQ ID NO: 34). Exon 2 is 
similarly amplified from the same genomic DNA using oligonucleotides 
5'CCCTTTTTTTTGGATCCCTCGCGG3> (SEQ ID NO: 35) and 
5 ' CT ACATGCTAGGCAGATCTCGTTCGGAG3 ' (SEQ ID NO: 36). Exon 3 is 
amplified using the oligonucleotides 5'CTCAACAATTGTTGGATCCGTACTCC3 ' 
(SEQ ID NO: 37) and 5' GTGCTC AGCAGATCTTGCGACTGTG3 ' (SEQ ID NO: 
38). 

The amplified exons are ligated together in any desired number and/or order by 
virtue of the unique BamHI and Bglll restriction sites (bold) in the primers for 
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subsequent ligation into a construct analogous to pD V67, prepared as described below, 
for expression of viral structural genes. 

Similarly, a fragment consisting of the first TPL exon (exon 1), the native first 
intron (intron 1), and the second TPL exon (exon 2) is produced by amplification from 
Ad5 genomic DNA with the oligonucleotide pair 5 ' GGCGCGTTCGGATCC 
ACTCTCTTCC3' (SEQ ID NO: 39) and 5'CTACATGCTAGGCAGATCT 
CGTTCGGAG3 ' (SEQ ID NO: 40). Finally, a fragment consisting of the second TPL 
exon (exon 2), the native second intron (intron 2), and the third TPL exon (exon 3) is 
produced by amplification using the oligonucleotides 5 'CCCTTTTTTTTGGATCC 
CTCGCGG3' (SEQ ID NO: 41) and 5 5 GTGCTCAGCAGATCTTGCGACTGTG3 ' 
(SEQ ID NO: 42). Either of the intron- containing fragments are used either alone or 
in combination with another TPL fragment(s) in constructs analogous to pDV67. 
Introns in addition to adenoviral intron 1 used herein that have been shown to increase 
the expression of recombinant proteins when included in expression constructs include 
SV40 VP1 intron, rabbit P-globin intron among others. The use of these alternative 
intron sequences are contemplated for use in preparing a TPL in the present invention. 

Example 6 

Preparation and Use of Adenoviral Packaging Cell Lines 
Containing Plasmids Containing Alternative TPLs 

Plasmids were first constructed as described below that contained TPLs are 
described above. The resultant plasmids containing different selectable markers such 
as neomycin or zeocin were then used to prepare stable cell lines for use as 
complementing vectors for preparing adenoviral vectors for use in the present 
invention. In a preferred embodiment, the resulting cell lines represent improvements 
over preexisting fiber-complementing cell lines in that fiber expression is enhanced with 
the use of alternative TPLs. 



A. pDV60 
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pDV60 was constructed by inserting this TPL cassette of SEQ ID NO: 32 into 
the BamHI site upstream of the Ad5 fiber gene in pcDNA3/Fiber, a neomycin 
selectable plasmid, prepared as described in Example 1 and also as described by Von 
Seggerne/a/., J. Gen Virol, 79: 1461 (1998). The nucleotide sequence of pDV60 is 
listed in SEQ ID NO: 43. 

B. pDV61 

To construct pD V6 1 , an Asp7 1 8/NotI fragment containing the CM V promoter, 
partial Ad5 TPL, wildtype Ad5 fiber gene, and bovine growth hormone terminator was 
transferred from pCLF, prepared as described in Example 1 and also as described by 
Von Seggern et al, J. Gen Virol, 79: 1461 (1998), to a zeocin selectable cloning 
vector referred to as pCDNA3 . 1/Zeo (+) (commerically available from Invitrogen and 
the sequence is also available). 

C. pDV67 

In an analogous process, pDV67 containing complete TPL was constructed by 
transferring an Asp 718/XbaI fragment from pDV60 to the pcDNA3.1/Zeo(+) 
backbone. The nucleotide sequence of pDV67 is listed in SEQ ID NO: 44. 

D. pDV69 

To prepare pDV69 containing a modified fiber protein, the chimeric Ad3/Ad5 
fiber gene was amplified from pGEM5TS3H (Stevenson et al, J. Virol, 69: 
2850-2857, 1 995)) using the primers 5 ' ATGGGAT 
CAAGATGAAGCGCGCAAGACCG3 ' (SEQ ID NO: 45) and 
5'CACTATAGCGGCCGCATTCTCAGTCATCTT3' (SEQ ID NO: 46), and cloned 
to the BamHI and NotI sites of pcDNA3.1/Zeo(+) via novel BamHI and NotI sites 
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engineered into the primers to create pDV68. Finally, the complete TPL fragment 
described above was then added to the unique BamHl site of pDV68 to create pDV69. 
The nucleotide sequence of pDV69 is listed in SEQ ID NO: 47. 

E, Preparation of Stable Adenovirus Packaging Cell Lines 

El -2a S8 cells are derivatives of the A549 lung carcinoma line (ATCC # CCL 
185) with chromosomal insertions of the plasmids pGRE5-2.El (also referred to as 
GRE5-El-SV40-Hygro construct and listed in SEQ ID NO: 48) and pMNeoE2a-3 . 1 
(also referred to as MMTV-E2a-SV40-Neo construct and listed in SEQ ID NO: 49), 
which provide complementation of the adenoviral El and E2a functions, respectively. 
This line and its derivatives were grown in Richter's modified medium (BioWhitaker) 
+ 10% FCS. El-2a S8 cells were electroporated as previously described (Von 
Seggernet al. y J. Gen Virol, 79: 1461 (1998)) with pDV61, pDV67, orwithpDV69, 
and stable lines were selected with zeocin (600 jig/ml). The cell line generated with 
pD V6 1 is designated 60 1 . The cell line generated with pD V67 is designated 63 3 while 
that generated with pDV69 is designated 644. Candidate clones were evaluated by 
immunofluorescent staining with a polyclonal antibody raised against the Ad2 fiber. 
Lines expressing the highest level of fiber protein were further characterized. 

For the S8 cell complementing cell lines, to induce El expression, 0.3 nM of 
dexamethasone was added to cell cultures 1 6-24 hours prior to challenge with virus for 
optimal growth kinetics. For preparing viral plaques, 5 X 1 0 5 cells/well in 6 well plates 
are prepared and pre-induced with the same concentration of dexamethasone the day 
prior to infection with 0.5 included at a final concentration in the agar overlay after 
infection. 

F. Development of Cell Lines for Complementation of E17E2a Vectors 

This example shows the construction of S.8 cells 

The Adenovirus 5 genome was digested with Seal enzyme, separated on an 
agarose gel, and the 6,095 bp fragment comprising the left end of the virus genome was 
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isolated The complete Adenovirus 5 genome is registered as Genbank accession 
#M73260, incorporated herein by reference, and the virus is available from the 
American Type Culture Collection, Manassas, Virginia, U.S.A., under accession 
number VR-5. The Seal 6,095 bp fragment was digested further with Clal at bp 917 
5 and Bglll at bp 3,328. The resulting 2,41 1 bp Clal to Bglll fragment was purified from 

an agarose gel and ligated into the superlinker shuttle plasmid pSE280 (Invitrogen, San 
Diego, CA), which was digested with Clal and Bglll, to form pSE280-E. (Figure 23). 

Polymerase chain reaction (PCR) was performed to synthesize DNA encoding 
an XhoT and Sail restriction site contiguous with Adenovirus 5 DNA bp 552 through 
10 924. The primers which were employed were as follows: 

O 5' end, Ad5 bp 552-585: 

11 5'-GTCACTCGAGGACTCGGTC-GACTGAAAATGAGACATATTATCTGCCA 
^ CGGACC-3' (SEQ ID NO: 66) 

2 3' end, Ad5 bp 922-891: 

f U15 5'-CGAGATCGATCACCTCCGGTACAAGGTTTGGCATAG-3' (SEQ ID NO: 67) 

|; 3 This amplified DNA fragment (sometimes hereinafter referred to as Fragment A) 

then was digested with Xhol and Clal, which cleaves at the native Clal site (bp 917), 

and ligated to the Xhol and Clal sites of pSE280-E, thus reconstituting the 5( end of 
O the El region beginning 8 bp upstream of the ATG codon. 

20 PCR then was performed to amplify Adenovirus 5 DNA from bp 3,323 through 

4,090 contiguous with an EcoRI restriction site. The primers which were employed 

were as follows: 

5' end, Ad5 bp 3323-3360: 

5-CATGAAGATCTGGAAGGTGCTGAGGTACGATGAGACC-3' (SEQ ID NO:68) 
25 3' end, Ad5 bp 4090-4060: 

5-GCGACTTAAGCAGTCAGCTG-AGACAGCAAGACACTTGCTTGATCCAA 
ATCC-3' (SEQ ID NO: 69) 

This amplified DNA fragment (sometimes hereinafter referred to as Fragment B) 
was digested with Bglll, thereby cutting at the Adenovirus 5 Bglll site (bp 3,382) and 
30 EcoRI, and ligated to the Bglll and EcoRI sites of pSE280-AE to reconstruct the 



- 98- 

complete Ela and Elb region from Adenovirus 5 bp 552 through 4,090. The resulting 
plasmid is referred to as pSE280-El (Figure 23). 

A construct containing the intact El a/b region under the control of the synthetic 
promoter GRE5 was prepared as follows. The intact El a/b region was excised from 
pSE280-El , which was modified previously to contain a BamHI site 3' to the El gene, 
by digesting with Xhol and BamHI. The Xhol to BamHI fragment containing the E 1 a/b 
fragment was cloned into the unique Xhol and BamHI sites of pGRE5-2/EB V (Figure 
4, U.S. Biochemicals, Cleveland, Ohio) to form pGRE5-El (Figure 24). 

Bacterial transformants containing the final construct were identified. Plasmid 
DNA was prepared and purified by banding in CsTFA prior to use for transfection of 
cells. 

Construction of plasmid including Adenovirus 5 E2A sequence. 

The Adenovirus 5 genome was digested with BamHI and Spel, which cut at bp 
21,562 and 27,080, respectively. Fragments were separated on an agarose gel and the 
5,518 bp BamHI to Spel fragment was isolated. The 5,518 bp BamHI to Spel 
fragment was digested further with Smal, which cuts at bp 23,9 12. The resulting 2,3 50 
bp BamHI to Smal fragment was purified from an agarose gel, and ligated into the 
superlinker shuttle plasmid pSE280, and digested with BamHI and Smal to form 
pSE280-E2 BamHI-Smal (Figure 26). 

PCR then was performed to amplify Adenovirus 5 DNA from the Smal site at bp 
23,9 12 through 24,730 contiguous with Nhel and EcoRI restriction sites. The primers 
which were employed were as follows: 
5' end, Ad5 bp 24,732-24,708: 

5'-CACGAATTCGTCAGCGCTTCTCGTCGCGTCCAAGACCC-3' (SEQ ID NO: 
70) 

y end, Ad5 bp 23,912-23,934: 

5 '-C ACCCCGGGGAGGCGGCGGCGACGGGGACGGG-3 ' (SEQ ID NO: 71) 

This amplified DNA fragment was digested with Smal and EcoRI, and ligated 
to the Smal and EcoRI sites of pSE280-E2 Bam-Sma to reconstruct the complete E2a 
region from Ad5 bp 24,730 through 21,562. The resulting construct is pSE280-E2a. 
(Figure 27.) 
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In order to convert the BamHI site at the 3' end of E2a to a Sail site, the E2a 
region was excised from pSE280-E2a by cutting with BamHI and Nhel, and recloned 
into the unique BamHI and Nhel sites of pSE280. (Figure 27.) Subsequently, the E2a 
region was excised from this construction with Nhel and Sail in order to clone into the 
Nhel and Sail sites of the pMAMneo (Clonetech, Palo Alto, CA) multiple cloning site 
in a 5' to 3' orientation, respectively. The resulting construct is pMAMneo E2a. 
(Figure 27). 

Bacterial transformants containing the final pMAMneo-E2a were identified. 
Plasmid DNA was prepared and purified by banding in CsTFA. Circular plasmid DNA 
was linearized at the XmnI site within the ampicillin resistance gene of pMAMneo -E2a, 
and further purified by the phenol/chloroform extraction and ethanol precipitation prior 
to use for transfection of cells. 
Transfection and selection of cells. 

In general, this process involved the sequential introduction, by calcium 
phosphate precipitation, or other means of DNA delivery, of two plasmid constructions 
each with a different viral gene, into a single tissue culture cell. The cells were 
transfected with a first construct and selected for expression of the associated drug 
resistance gene to establish stable integrants. Individual cell clones were established 
and assayed for function of the introduced viral gene. Appropriate candidate clones 
then were transfected with a second construct including a second viral gene and a 
second selectable marker. Transfected cells then were selected to establish stable 
integrants of the second construct, and cell clones were established. Cell clones were 
assayed for functional expression of both viral genes. 

In order to determine the most suitable cell lines for the above-mentioned 
transfections, sequential transfections and selections were carried out with the 
following parental cell types: 

A549 (ATCC Accession No. CCL-185); 

Hep-2 (ATCC Accession No. CCL-23); or 

KB (ATCC Accession No. CCL-17). 

Appropriate selection conditions were established for both G4 1 8 and hygromycin 
B for all three cell lines by standard kill curve determination. 
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Transfection of cell lines with plasmids including El and E2a regions. 

pMAMNeo-E2a was linearized with XmnI with the Amp R gene, introduced into 
cells by transfection, and cells were selected for stable integration of this plasmid by 
G418 selection until drug resistant colonies arose. The clones were isolated and 
screened for E2a expression by staining for E2a protein with a polyclonal antiserum, 
and visualizing by immunofluorescence. E2a function was screened by 
complementation of the temperature-sensitive mutant Ad5tsl25 virus which contains 
a temperature-sensitive mutation in the E2a gene. (Van Der Vliet, et al, J. Virology, 
Vol. 15,pgs. 348-354(1975)). Positive clones expressing the E2a gene were identified 
and used for transfection with the 7 kb EcoRV to XmnI fragment from pGRE5-El 
(Figure 5), which contains the GRE5 promoted Ela/b region plus the hygromycin R 
gene. Cells were selected for hygromycin resistance and assayed for Ela/b expression 
by staining with a monoclonal antibody for the El protein (Oncogene Sciences, 
Uniondale, N.Y.). El function was assayed by ability to complement an El-deleted 
vector. At this point, expression and function of E2a was verified as described above, 
thus establishing the Expression of both Ela/b and E2a in the positive cell clones. 

One of the transfected A549 cell lines showed good Ela/b and E2a expression 
and was selected for further characterization. It was designated the S8 cell line. 

G. Preparation of Adenoviral Vectors Containing AdS.PgaLAF Genome 
in S8 Improved Fiber-Complementing Cell Lines 

To prepare adenoviral vectors containing Ad5.pgal.AF in S8 cells containing 
alternative forms of TPL for enhancing the expression of fiber proteins, the protocol 
as described in Example 2 for preparing Ad5.Pgal.AF in 21 IB cells was followed with 
the exception of pretreatment with 0.3 \xM dexamethasone for 24 hours as described 
above. Thus, viral particles with the wildtype Ad5 fiber protein on their surface and 
containing the fiberless Ad5.Pgal.AF genome were produced in 633 cells. Particles 
produced in 644 cells also contained the fiberless Ad5.Pgal.AF genome, but had the 
chimeric 5T3H fiber protein, with the Ad3 fiber knowb, on their surface. 
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The preparation of the cell lines and demonstration of stable nuclear expression 
of either wild-type Ad5 fiber protein or chimeric Ad5/Ad3 protein is shown in Figure 
20. In the figure, schematic diagrams are presented of the constructs used to generate 
the cell lines as well as immunofluorescence results indicating the presence of expressed 
fiber protein in the nucleus of the cells. 

Thus, these viral preparations, prepared as described herein and in Example 2, 
are useful for targeting delivery of Ad5 . Pgal.AF fiberless genome with either wild-type 
or modified fibers, embodiments of which uses have been previously discussed and as 
further exemplified with the pseudotyping and infectivity results described in Example 
7. 

Example 7 

Pseudotyping and Infectivity of Recombinant Adenoviral Vectors 
Produced with Improved Fiber-Complementing Cell Lines 

A. Pseudotyping of AdS.PgaLAF 

To verify that adenoviral vectors were produced had altered tropisms, viral 
particles were purified from either 633 or 644 cells and were then Western blotted and 
probed with a polyclonal rabbit antibody against the Ad2 fiber (which detects both the 
Ad5 and chimeric 5T3H fiber proteins.). The results are shown in Figure 21 where 
both fiber proteins were detectable confirming pseudotyping. 

B. Infectivity of Cells with 633 or 644 Generated Virus Particles 

The cell lines, 633 or 644, prepared as described above, were infected with the 
indicated number of particles/cell of Ad5.pgal.AF and virus particles produced. Virus 
was then used to infect, as previously described, selected cell lines as shown in Figure 
22, including 21 IB, MRC-5 human fibroblasts, A- 10 rat aortic endothelial cells, and 
THP-1 human monocytic cells. Unbound virus was removed by washing the cells and 
the cells were further incubated at 37 °C for 48 hours. Cells were then fixed with 
glutaraldehyde and stained with X-gal. The percentage of stained cells was then 
determined by light microscopy where all experiments were done in triplicate. 
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The results shown in Figure 22 indicate that adenoviral vectors could be 
retargeted by pseudotyping using packaging cell lines expressing different fiber 
proteins. The data marked with "none" indicates virus grown in 293 cells and lacking 
fiber, while " Ad5 " indicates virus prepared in 633 cells (containing the wild type fiber) 
and Ad3 indicates virus prepared in 644 cells (containing the chimeric 5T3H fiber.) 
Particles containing either fiber were equally infectious on 21 IB cells, while MRC-5 
fibroblasts and THP- 1 cells were more readily infected by virus containing the chimeric 
fiber. The A- 1 0 rat endothelial cells were more readily infected by particles containing 
the wildtype Ad5 fiber protein. 

Example 8 

Targeted Gene Delivery Using Viral Vector 
Particles Lacking Fiber Protein 

An alternative mode of entry for adenoviral infection of hematopoietic cells has 
been described by Huang, et al , J. Virol, (59:2257-2263 (1995) which does not involve 
the fiber protein-host cell receptor interaction. As infection of most other cell types 
does require the presence of fiber protein, vector particles which lack fiber may 
preferentially infect hematopoietic cells, such as monocytes or macrophages. 

To produce a fiber-free adenovirus vector particle, a vector lacking the fiber 
gene as described above in Example 2A but containing a gene of interest for delivery 
is amplified by growth in cells which do not produce a fiber protein, such as the 21 1 
cells prepared in Example 1 or 293 or S8 cells as described herein, thereby producing 
large numbers of particles lacking fiber protein. The recovered fiber-free viral particles 
are then used to deliver the inserted gene of interest following the methods of this 
invention via targeting mechanisms provided by other regions of the adenoviral vector, 
i.e., via the native penton base. 
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A. Construction of an Adenovirus Vector Deleted for El, E3, and 
Fiber, and Carrying a Therapeutic Gene of Interest 

A general method of constructing a fiber-deleted Ad vector containing a 
therapeutic gene of interest (in this example, the Herpes Simplex Virus Thymidine 
Kinase (TK) gene) is described here. Linear viral DNA is isolated from a preparation 
of Ad5.Pgal.AF particles. This DNA is digested with the restriction enzyme Clal, 
which removes the leftmost viral sequences including the left ITR, the packaging 
signals, and part of the SV40-driven p-galactosidase gene. The large Clal fragment 
with the remainder of the fiber-deleted viral genome is then isolated by centrifugation 
on a sodium chloride or sucrose gradient. The plasmid pAdShuttleTK, which contains 
the left part of the Ad chromosome with an RSV-driven TK gene inserted in place of 
the El region, is linearized by digestion with Not! The nucleotide sequence of the 
pAdShuttleTK is shown in SEQ ID NO: 50. The large Clal fragment of Ad5.Pgal.AF 
and the linearized pAdShuttleTK are cotransfected into 21 IB cells, and an infectious 
adenovirus genome is generated by homologous recombination. A virus deleted for 
El, E3, and fiber that contains the TK cassette in the place of the El deletion is thus 
recovered. A virus containing any desired therapeutic gene of interest can be created 
in this manner by replacing the TK gene of the example with the gene of interest. 

An alternative method of constructing a fiber-deleted genome containing a 
therapeutic gene (in this example the retinal degeneration-slow (RDS) gene driven by 
the CMV immediate early promoter) is described here. RDS is a protein expressed in 
photoreceptors, and essential for their proper development and functioning. RDS 
mutations have been implicated in retinal degenerative disorders, and transfer of the 
wildtype RDS gene by means of an Ad vector provides an avenue towards treating such 
disorders. 

A plasmid (pDV50) analogous to pAElBpgal but containing a CMV- driven 
RDS gene was constructed as follows. First, a fragment containing the CMV promoter 
and enhancer was excised from pCHaMIEP by digestion withh Hindlll, filling the 
overhanging ends with the large fragment ofE. coli DNA polymerase 1, ligation of 
BamHI linkers (5'CGCGGATCCCG3' SEQ ID NO: 51) to the blunt ends, and 
digesting with BamHI. The resulting fragment was then ligated into the BamHI site 
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of pAElspla (Mikrobix) to create pDV45, A fragment containing the SV40 
polyadenylation signal was amplified from pSVPgal (Promega) using the 
oligonucleotides 5 ? CTGACAAACTCAGATCTTGTTTATTG3' (SEQ ID NO: 51) 
and 5 ' GTCGACTCTAGAGGATCCAGA3 ' (SEQ ID NO: 52). This fragment was 
ligated into the Bglll site of pDV45 to create pDV46, using the unique BamHI and 
Bglll sites (bold type) in the primers. Finally, the human RDS open reading frame was 
amplified from the plasmid pRDS-T7 using the oligonucleotides 5'CCGGACTCT 
AGATGGCAACCATGGCGCTAC3 ' (SEQ ID NO: 53) and 5'GGA 
GGGGAAGCTTGGCCCTCAGCCAGCCTCT3 ' (SEQ ID NO: 54). This fragment 
was inserted into the Hindlll and Xbal sites of pDV46, again using unique restriction 
sites in the primers, to create pDV50. pDV50 therefore contains a cassette consisting 
of the CMV promoter, the RDS open reading frame, and the SV40 terminator 
sequences inserted in place of the Ad5 El region. 

In a manner analogous to the construction of Ad5 . Pgal. AF, pDV50 and pDV44 
are then co-transfected into 21 IB cells, and an infectious Ad genome (Ad5.RDS.AF) 
is recovered. A fiber-deleted Ad vector containing any desired gene to be expressed 
can be constructed by replacing the RDS gene of this example with the gene of interest. 

Example 9 

Transient Transcomplementation 

Human adenovirus type 5 (Ad5) is being developed as a vector for gene 
therapy. Its ability to deliver therapeutic genes to cells is mediated by the interaction 
of the adenoviral fiber protein with the coxsackievirus-adenoviral receptor (CAR). 
Because a wide-range of cells express CAR, it can be difficult to use adenoviruses to 
deliver genes to specific cell types. One way to address this is to target the virus to a 
particular cell type by genetically altering the fiber. However, the genetic 
manipulations involved in cloning and production of the viruses with altered fibers can 
be time-consuming. Thus it would be a significant advancement in the field of 
adenoviral gene therapy to have a more streamlined system for testing modified fiber 
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genes. An in vitro system has thus been developed that involves infection of tissue 
culture cells with a fiber-deleted Ad and transient co-transfection with a plamid 
directing fiber expression. This system allows one to produce and evaluate such 
modified fibers in the context of a viral particle easily and quickly. In addition this 
system can be envisioned to actually produce therapeutic quantities of adenoviral 
vectors with modified fiber proteins, with such fibers having a new tropism added by 
insertion of a desired ligand into the fiber gene. These fibers may also have the natural 
tropism (i.e. binding to CAR) ablated. 

Plasmids used were pDV60 and pDV55, prepared as described herein. pDV60 
is an p cDN A3. 1 -based expression plasmid that contains the CMV promoter, Ad5 
tripartite leader, an intron, and the Ad5 fiber gene sequence. pDV55 contains no fiber 
gene and serves as the negative control. Ad5.pgal.AF and 21 IB are described above. 
293T cells are identical to 293 cells except they express an integrated SV40 large T 
antigen gene. HDF cells are human diploid fibroblasts. 293 T cells express CAR and 
a Y integrins; HDF cells express a v integrins but no CAR. Transfections with fiber 
expression plasmids were performed with Lipofectamine (GIBCO-BRL) using 20mg 
DNA and 50ml Lipofectamine per 15cm dish. Cells were maintained in DMEM 
supplemented with 10% fetal bovine serum. 

The fiber deletion mutation of Ad5.PgaLAF is complemented in trans by 
passaging virions through 21 IB, a cell line that stably expresses functional Ad5 fiber. 
The present system was designed to complement Ad5.pgal.AF by modified fibers 
expressed from transfected episomal plasmids in 293 T cells. The result is a simplified 
and rapid method to incorporate modified fibers on a viral particle containing the 
Ad5.Pgal.AF genome that does not require propagation of the virus. 

The feasibility of transcomplementation of AdS.pgaLAF with episomal 
fiber-expressing plasmids was demonstrated in the following experiment. 293T cells 
were transfected with one of two plasmids: pDV55, which expresses no fiber or 
pDV60, which expresses wildtype Ad5 fiber. Fiber expression persists for at least six 
days, suggesting that the plasmid is stable as an episome for this amount of time 
Twenty-four hours after transfection, these cells were infected at 2000 particles/cell 
with Ad5 . Pgal.AF passaged through 2 1 IB cells. Seventy-two hours later, a crude viral 
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lysate (CVL) was generated by exposing the cells to five freeze-thaw cycles. Viral 
particles were purified by cesium chloride gradient centrifugation. The resulting virions 
incorporated the fiber expressed from the episomal plasmid, as confirmed by Western 
blots performed with an antibody specific to the Ad5 fiber. 

To demonstrate the functionality of these virions, the transduction efficiency 
was tested. The virions containing no fiber (pDV55) or wildtype fiber (pDV60) were 
applied to monolayers of 293 T and HDF cells at different multiplicity of infection 
(MOFs). 293 T cells express CAR and a v integrinss; HDF cells express a v integrins but 
no CAR. After 2 days, the cells were fixed and stained with X-gal to detect the 
P-galactosidase reporter gene activity. The results showed low transduction efficiency 
for the pDV5 5 -complemented virions in both cell lines. As expected, the 
pDV60-complemented virions transduced 293 T cells to a high degree but did not 
transduce HDF cells, indicating that functional fiber proteins had been expressed from 
the episomal plasmids and incorporated into the virions. This transduction efficiency 
was comparable to or better than that of Ad5.Pgal.AF virions passaged through the 
21 IB cells. 

Episomal plasmid transcomplementation system is suitable for quickly 
expressing and evaluating the properties of modified fibers in the context of a viral 
particle. Episomal plasmid transcomplementation will also be of great utility for 
quickly evaluating a bank of modified fibers for other binding properties, including 
novel tropism and the ablation of the native tropism. In addition to the rapid 
generation and testing of large numbers of modified fibers, there are other advantages 
to the Ad5.pgal.AF transcomplementation system in terms of production and safety. 
Episomal plasmid transcomplementation has the inherent advantage over 
transcomplementation in that it is not necessary to make a stable cell line for every 
modified fiber with which you want to complement Ad5.pgal.AF. Because the 
Ad5.pgal.AF is deleted in El, E3 and fiber, there is an additional gene deletion 
compared to other first generation vectors. This makes t Ad5 . Pgal. AF more replication 
defective and presumably safer. In addition, the presence of the fiber gene deletion 
decreases the opportunity to generate replication-competent virus via recombination 
in the packaging cells. In terms of production a single Ad vector prep could be 
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retargeted to any number of different cell types simply by transfecting the cells with the 
appropriate fiber-expression construct. 

Example 10 

Adenoviral Gene Delivery Vectors Containing the 
Ad37 Fiber Protein 

Adenovirus type 37 (subgroup D) has been associated with infections of the eye 
and genital tract, and may be useful for targeting these tissues or other mucous 
membranes, as well as other cell types. The tropism of Ad37 is due to the binding 
preference of its fiber protein, which binds to an as yet-unidentified receptor located 
on the surface of cells including Chang C, conjunctival epithelial cell line (Huang et aL, 
X Virology 73(4) ;2798-2802 (1 999)). As this fiber directs viral infection to cell types 
different than those infected by Ad5, it is likely to provide a method for targeting gene 
delivery. This example describes construction of packaging cell lines expressing the 
Ad37 fiber protein, and their use in generating particles of a fiber-deleted Ad vector 
(such as Ad5 . Pgal. AF) containing this fiber protein. The fiber protein is attached to the 
viral capsid by binding to the penton base protein through its N-terminus, and the Ad3 7 
fiber was modified in order to make its N-terminal sequence more closely match that 
of the Ad5 protein to ensure that it would efficiently bind the Ad5 penton base in these 
vectors. 

L Construction of an Expression Plasmid for the Ad37 Fiber Protein 
(pDV80) 

This plasmid uses the same regulatory elements as contained in pDV60, 
pDV67, and pDV69 to express the Ad37 fiber in packaging lines, and was constructed 
in two steps. First, the Ad37 fiber open reading frame was amplified from Ad37 
genomic DNA (obtained from the ATCC - accession number VR-929)using the 
synthetic oligonucleotides primers L37 (5* TGT CTT GGA TCC AAG ATG AAG 
CGC GCC CGC CCC AGC GAA GAT GAC TTC 3') (SEQ ID NO: 56) and 37FR (5» 
AAA CAC GGC GGC CGC TCT TTC ATT CTT G 3') (SEQ ID NO: 57). L37 
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contains nucleotides that differ from the Ad37 genomic sequence in order to add an 
unique Bam HI site (bold in the above sequence) and create point mutations to make 
the N-terminal sequence of the fiber more closely match that of the Ad6 protein 
(underlined in the above sequence; the start codon is italicized). 37FR incorporates 
changes to create a unique Not 1 site (bold). The PCR product was inserted into the 
Bam HI and Not 1 sites of pCDNA3.1zeo(+) (Invitrogen) to create pDV78. The 
correct sequence of the Ad37 fiber gene, including the predicted changes, was 
confirmed by sequencing. 

Second, a 1.2 kb Bam Ul/Bgl II fragment containing an adenovirus type 5 
tripartite leader was excised from pDV55 (DVS 1999) and inserted into the Bam HI 
site of pDV78 to create pDV80 (SEQ ID NO:64). 

2. Isolation of Cell Lines Expressing the Ad37 Fiber Protein 

pD V80 DNA was purified using the Qiagen method and electroporated into the 
adeno virus-complementing cell line El -2a S8 (Gorzigliae/a/., J. Virology 70(5)A\13- 
4178 (1996)) as previously described (Von Seggern, et aL, J. Gen. Virol 79:1461- 
1418), and stable clones were selected with 600 |iig/ml zeocin (Invitrogen). Clones 
were expanded and screened for fiber expression by indirect immunofluorescence using 
arabbit polyclonal antibody directed against the Ad37 fiber. Two clones (lines 705 and 
731) that expressed the protein at a uniformly high level were selected for further 
study. 

3. Production of Pseudotyped Ad Vector Particles 

To generate vector particles equipped ('pseudotyped') with the Ad37 fiber 
protein, the Ad37 fiber-expressing 705 cells were infected (approximately 1000 
particles/cell) with Ad5.pgal.AF or with Ad5.GFP,AF. 

Ad5.pgal.AF is prepared as previously described. Ad5.GFP.AF was constructed 
by recombination in bacteria using a modification of the method of (He ? et aL, PNAS 
95:2509-2514 (1998)). First, a fiber-deleted genomic plasmid was constructed by 
removing the fiber gene from pAdEasyl (He, et aL, PNAS 95:2509-2514 (1998)). 
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pDV43 (Von Seggern, etal, J. Virol 73:1601-1608 (1999)) was digested with Pac 
1, the ends blunted by treatment with the large fragment ofE. coli DNA polymerase 
and dNTPs, and the product re-ligated. The resulting plasmid, pDV76, is identical to 
pDV43 except for loss of the Pac 1 site and contains the right end of the Ad5 genome 
with E3 and fiber deletions. A 4.2 kb fragment was amplified from pDV76 using the 
oligonucleotides primers 5' CGC GCT GAC TCT TA GGA CTA GTT TC 3' (SEQ 
ID NO: 58) (including the unique Spe 1 site in the Ad5 genome, bold) and 5' GCG 
CTT AAT TAA CAT CAT CAA TAA TAT ACC TTA TTT T 3' (SEQ ID NO: 59) 
(including a novel Pac 1 site (bold) adjacent to the right Ad5 ITR). This PCR fragment 
therefore contains nucleotides 27 ? 082 to 35,935 of the Ad5 genome with a deletion of 
nucleotides 28133 to 32743 (the E3 and fiber genes), and was used to replace the 
corresponding Spe 1 1 Pac 1 fragment of PAdEasyl to create pDV77. 

E. coli strain BJ5 1 83 was electr op orated with a mixture of pDV77 and Pme 1 - 
linearized pAdTrack as described (He et al, 1998), and DNA was isolated from 
kanamycin-resistant colonies. The resulting plasmid, pDV83, contains a complete El 
E3-, and fiber-deleted Ad5 genome with a CMV-driven GFP reporter gene inserted at 
the site of the El deletion. The full-length Ad chromosome was isolated by Pac 1 
digestion, and transfected to the El - and fiber-complementing 633 cells (Von Seggern 
et al., J. Virol January 2000). The recovered virus was then plaque purified by plating 
on 633 cells and stocks were prepared. 

Ad5-pseudotyped particles were generated by virus growth in 633 cells, which 
express the wild type Ad5 fiber protein. Viral particles were isolated and purified over 
CsCl gradients as previously described (Von Seggern et al., J. Virol 73:1601-1608, 
1999). For analysis of viral proteins, ten \ig of the purified particles were 
electrophoresed on 8-16% gradient gels and the protein transferred to nylon 
membranes. The blot was then probed with rabbit polyclonal antibodies raised against 
recombinant Ad37 fiber or Ad5 fiber or penton base proteins expressed in baculovirus- 
infected cells (Figure 27). 
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ExampIe 11 

Construction of a Fiber Expression Construct Containing a Post- 
Transcriptional Regulatory Element 

Previous studies have shown that mRNA transcribed from the woodchuck 
hepatitis virus (WHV) genome contains an element (the WHV post-transcriptional 
regulatory element, or WPRE) which can increase expression of a protein encoded by 
the mRNA via a post-transcriptional mechanism (Loeb et al, Human Gene Therapy 
70:2295-2305 (1999)). The WPRE has also been shown to enhance expression of 
transgenes delivered by retroviral vectors. (Zufferey, R. et al, J. Virol 75:2886-2892 
(1999)). This example describes the construction of a fiber expression construct 
(pDV90) (SEQ ID NO:65) containing a WPRE as well as the promoter and TPL 
sequences as contained in pDV67. 

A plasmid (pBS/WPRE) which contains the WPRE was obtained from Dr. 
Thomas Hope, Salk Institute. Digestion of pBS/WPRE with Cla\ releases a 600 bp 
fragment containing the WPRE (nt 193-1684 of the WHV genome.) Following C/al 
digestion, the ends of this fragment were filled by treatment with the large fragment of 
E. coli DNA polymerase 1 in the presence of dNTPs to render them blunt. pDV67 
DNA was digested with^Jal (which cuts at a unique site in the transcribed region 
downstream of the Ad5 fiber open reading frame) and the ends filled by the same 
treatment. The filled WPRE fragment was then ligated into the filled Xba 1 site of 
pDV67 to create pDV90. The sequence is found at GenBank accession no. J04514 
(entire genome) inZufferey, R. etaL, J. Virol 73:2886-2892 (1999). 

pDV90 (SEQ ID NO:65) was electroporated into El -2a S8 cells and stable 
clones expressing fiber isolated as described previously for pDV80. 

Example 12 

Construction of an Ad5 Fiber Protein with Heterologous 
Peptide Sequences Inserted in the HI Loop 

The receptor-binding knob domain of the Ad5 fiber protein contains several 
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surface loops which are attractive candidates for the insertion of heterologous peptide 
sequence, as an additional ligand for vector targeting. This example describes the 
construction of a fiber gene which encodes a fiber protein containing a 6 amino acid 
peptide linker in the HI loop, and retains the ability to trimerize. The modified gene 
also contains a unique novel restriction site at the position of the linker insertion to 
facilitate addition of the targeting ligand into the HI loop. 

The Ad5 fiber gene was amplified from Ad5 genomic DNA (ATCC accession 
number VR-5) using the primers Fiber ATG (5' TGA AGC GCG CAA GAC CGT 
CTG AAG 3 ') (SEQ ID NO: 60) and Fiber TAA (5 1 CAT AAC ACT GCA GAT TCT 
TTA TTC TTG G3') (SEQ ID NO: 61), and cloned to the Ndel (filled with the large 
fragment of E. coli DNA polymerase 1 in the presence of dNTPs) and Pst 1 sites of 
pT7-7 using a unique Pst 1 site (bold) in the 'Fiber TAA' oligo. The resulting plasmid, 
pT7/fiber, was digested with Xba 1 and Pst 1 to excise the fiber gene, which was then 
cloned into the Pst 1 and Xba 1 sites of pUC119 to create pUC/fiber. This pUC- 
derived plasmid contains an origin for single-stranded DNA replication and can 
therefore be used to create template DNA for site-directed mutagenesis. 

Site-directed mutagenesis was carried out according to the method of Kunkel 
(T.A. Kunkel, PNAS 52:488-492 (1985)) using the oligonucleotide primer T542 (5' 
GGT ACA CAG GAA ACA GGA GGT TCC GGA GGT GGA GGA GAC ACA 
ACT CC 3') (SEQ ID NO: 62). This results in the addition of 18 new bases 
(underlined) encoding the sequence Gly Gly Ser Gly Gly Gly (SEQ ID NO: 63), with 
a novel BspE 1 site (bold) for the addition of further sequences. The inserted sequence 
is between Thr542 and Gly543 of the Ad5 fiber protein, in the HI loop. The modified 
plasmid is termed pDV14. 

Finally, the modified fiber gene was excised from pDVl 4 by digestion with Pst 
1 and Xba 1 and cloned into the Pst 1 and Xba 1 sites of pGEM3Z (Promega) to create 
pDVl 8. In vitro transcription/translation experiments with pDVl 8 (using the TNT™ 
kit, Promega) demonstrated that the modified fiber gene encoded a protein which was 
capable of trimerizing. 

Alternatively an Ad5 fiber open reading frame (ORF) is amplified from Ad5 
genomic DNA (wildtype Ad5 was purchased from the ATCC) using the 
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oligonucleotides 5' ATG GGA TCC AAG ATG AAG CGC GCA AGA CCG 3 ' (SEQ 
ID NO: 72) and 5' CAT AAC CTG CAG GAT TCT TTA TTC TTG GGC 3' (SEQ 
ID NO: 73), and inserted into the BamHl and /to 1 sites of pGEM-3Zf(+) (Promega 
Inc., Madison, WI) via novel restriction sites (bold type) designed into the primers. 
The 5' oligonucleotide also contains a G to A change 3 nucleotides 5' of the initial ATG 
codon (underlined), designed to improve the consensus for translation initiation. 

Site-directed mutagenesis is performed by the method of Kunkel (Proc. Nat. 
Acad. Sci. 82 :488-492 (1985)), using the synthetic oligonucleotide 5' GGT ACA CAG 
GAA ACA GGA GGT TCC GGA GGT GGA GGA GAC ACA ACT CC 3' (SEQ 
ID NO: 74). This operation introduced sequence (bold type) encoding 6 novel amino 
acids (Gly Gly Ser Gly Gly Gly) immediately following Threonine 542 of the Ad5 fiber, 
and including a unique restriction site for the insertion of further heterologous 
sequences (underlined). The resulting plasmid (pDV18A) contains the modified fiber 
gene under the control of the T7 promoter in the parental pGEM-3Zf(+) and can be 
used for in vitro transcription/translation reactions to produce labeled fiber protein. 

Example 13 

Use of the Fiber Expression System to Retarget 
(Tseudotype') Hybrid Ad/AAV Vectors 

Adenoviral vectors which lack essentially all Ad genes ('helper-dependent 1 or 
'gutless' vectors) have recently been developed. In a modification of this idea, vectors 
('hybrid' vectors) which contain an adeno-associated virus (AAV) or retroviral genome 
have been generated. As AAV and retroviral genomes integrate into the chromosome 
of the target cells, the hybrid Ad/ AAV or Ad/retroviral vectors have the potential to 
provide very long-term gene expression. 

Lieber^a/., (J. Virol 73(1 1):93 14-9324) describe an Ad vector (Ad.AAVl) 
which contains an AAV vector genome (a transgene insert flanked by the AAV 
inverted terminal repeats) inserted into the El region. When 293 cells are infected by 
Ad.AAVl, recombination between the AAV sequences generates a minimal Ad 
chromosome which carries the Ad inverted terminal repeats and packaging signal 
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flanking the AAV vector genome. This chromosome cannot direct the synthesis of Ad 
proteins, but can be packaged into Ad vector particles. The remaining unrecombined 
Ad chromosomes provide the Ad structural proteins in trans, and both the full-length 
and minimal genomes are packaged into particles. The particles carrying the minimal 
Ad/AAV hybrid vector are then isolated by CsCl centrifugation. 

These particles have the capsid structure of adenovirus, and infect cells using 
the efficient fiber- and penton base-mediated pathway used by Ad. Following infection, 
the hybrid genome is able to integrate into the cell's chromosomes by virtue of its AAV 
sequences. In this example, the AAV vector genome is inserted into the El region of 
a fiber-deleted vector, and the resulting vector is grown in packaging lines expressing 
either the Ad5 or Ad37 fiber proteins. The particles recovered therefore have the 
tropisms expected from the respective fiber proteins combined with the ability to 
integrate their AAV genome into target cells. Such pseudotyping should be possible 
with any of a number of modified fiber proteins, as for the fiber-deleted vectors already 
described by us. 

The Ad vector is constructed in a manner analogous to that described for 
Ad5.Pgal.AF, by recombination between pAd.AAVl (Lieber etal 1 Virol 75:9314- 
9324, 1999) and pDV44 (as described earlier in the specification.) pAd.AAVl carries 
an ML V promoter-driven secreted alkaline phosphatase gene (SEAP) as a reporter, and 
an S V40-driven neomycin phosphotransferase (neo) gene to allow the selection of cells 
stable transduced by the AAV cassette. The resulting vector (Ad. AAV1 AF) has the 
AAV vector cassette of Ad.AAVl inserted into the El region of a genome with the 
fiber deletion of Ad5 . pgal.AF. Growth of Ad. AAV1 .AF in 633 cells results in particles 
carrying the AAV genome and the Ad5 fiber, and which have the tropism associated 
with Ad5 . Growth of Ad.AAVl . AF in 705 cells produces particles bearing the Ad37 
fiber and therefore having its associated different tropism. 

Tropism is evaluated by infecting Chang C cells (which express the Ad37 
receptor) and A549 cells which do not express this protein but do express the Ad5 
receptor (CAR). The extent of infection is monitored by assaying alkaline phosphatase 
expression, and the fraction of cells stable transduced is assayed by selection with 
neomycin. By using purified recombinant Ad5 or Ad37 fiber proteins as competitors 
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during infection, the usage of the expected receptors by the pseudotyped particles is 
evaluated. 

Example 14 

Use of the Fiber Expression System to Retarget 
( r Pseudotype f ) Helper-dependent Ad Vectors 

Gutted Ad vectors are those from which most or all viral genes have been 
deleted. They are grown by co-infection of the producing cells with a "helper" virus 
(using an El -deleted Ad vector). The helper virus /ra^-complements the missing Ad 
functions, including production of the viral structural proteins needed for particle 
assembly. In one embodiment of this invention, the helper virus is a fiber-deleted Ad 
(such as that described in Von Seggern et al, J. Virol 73:1601-1608 (1999)). The 
vector is prepared in a fiber expressing cell line such as has been previously described 
by Von Seggern et al, J. Gen. Virol 79:1461-1468 (1998), Von Seggern et al, J. 
Virol 74:354-362 (2000). All the necessary Ad proteins except fiber are provided by 
the fiber-deleted helper virus, and the particles are equipped with the particular fiber 
expressed by the host cells. A concern with gutted vectors has been contamination of 
a vector preparation with residual helper virus. As the helper virus in one aspect of this 
invention is deleted for both El and for fiber, it is more replication defective and 
therefore safer than those currently used. 

A helper adenovirus vector genome and a gutless adenoviral vector genome 
are delivered to the packaging cells of the invention. The cells are maintained under 
standard cell maintenance or growth conditions, whereby the helper vector genome and 
the packaging cell together provide the complementing proteins for the packaging of 
the adenoviral vector particle. Such gutless adenoviral vector particles are recovered 
by standard techniques. The helper vector genome may be delivered in the form of a 
plasmid or similar construct by standard transfection techniques, or it may be delivered 
through infection by a viral particle containing the genome. Such viral particle is 
commonly called a helper virus. Similarly, the gutless adenoviral vector genome may 
be delivered to the cell by transfection or viral infection. 
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The helper virus genome is preferably the fiberless adenovirus vector genome 
as disclosed herein. Preferably, such genome also lacks the genes encoding the 
adenovirus E1A and E1B proteins. More preferably, the genome further lacks the 
adenovirus genes encoding the adenovirus E3 proteins. Alternatively, the genes 
5 encoding such proteins may be present but mutated so that they do not encode 

functional El A, E1B and E3 proteins. Furthermore, such vector genome may not 
encode other functional early proteins, such as E2A, E2B, and E4 proteins. 
Alternatively, the genes encoding such other early proteins may be present but mutated 
so that they do not encode functional proteins. 
10 The helper virus genome is used in conjunction with the packaging cell of the 

f invention. As disclosed elsewhere herein, the packaging cell also provides proteins 

,= necessary for the complementation of the gutless vector so that an adenovirus particle 

jl containing the gutless vector genome may be produced. Thus, the packaging cell can 

J } provide wild-type or modified fiber protein as described herein. Alternatively, the cell 

yl5 could package a fiberless particle which could be used by itself or to which 

*-= exogenously provided fiber could be added as described elsewhere herein. 

; :: In producing the gutless vectors, the helper virus genome is also packaged, 

z thereby producing helper virus. In order the minimize the amount of helper virus 

Z produced and maximize the amount of gutless vector particles produced, it is preferable 

20 to delete or otherwise modify the packaging sequence in the helper virus genome, so 

that packaging of the genome is prevented or limited. Since the gutless vector genome 
will have a packaging sequence, it will be preferentially packaged. 

One way to do this is to mutate the packaging sequence by deleting one or 
more of the nucleotides comprising the sequence or otherwise mutating the sequence 
25 to inactivate or hamper the packaging function. An alternative approach is to engineer 

the helper genome so that recombinase target sites flank the packaging sequence and 
to provide a recombinase in the packaging cell The action of recombinase on such 
sites results in the removal of the packaging sequence from the helper virus genome. 
Preferably, the recombinase is provided by a nucleotide sequence in the packaging cell 
30 that encodes the recombinase. Most preferably, such sequence is stably integrated into 

the genome of the packaging cell. Various kinds of recombinase are known by those 
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skilled in the art. The preferred recombinase is Cre recombinase, which operates on 
so-called lox sites, which are engineered on either side of the packaging sequence as 
discussed above. Further information about the use of Cre-loxP recombination is found 
in U.S. Pat. No. 5,919,676 and Morsy and Caskey, Molecular Medicine Today, Jan. 
1999, pgs. 18-24, both incorporated herein by reference. 

This example demonstrates how the fiber-expressing packaging lines can be 
used to generate pseudotyped particles of helper-dependent or 'gutless' vectors with 
altered tropisms. As the gutless vectors lack many or all Ad genes, they must be grown 
as mixed cultures in the presence of a helper virus which can provide the missing 
functions. To date, such helper viruses have provided all Ad functions except El, and 
El is complemented by growth in 293 cells or the equivalent. The resulting virus 
particles are harvested, and the helper virus is typically removed by CsCl gradient 
centrifugation (the vector chromosome is generally shorter than the helper 
chromosome, resulting in a difference in buoyant density between the two particles). 

An example of a gutless vector is pAdARSVDys (Haecker et al, Human Gene 
Therapy 7; 1907-1914 (1996)). This plasmid contains a full-length human dystrophin 
cDNA driven by the RS V promoter and flanked by Ad inverted terminal repeats and 
packaging signals. 293 cells are infected with a first-generation Ad which serves as a 
helper virus, and then transfected with purified pAdARSVDys DNA. Both the helper 
Ad genome and the pAdARSVDys DNA are replicated as Ad chromosomes, and 
packaged into particles using the viral proteins produced by the helper virus. Particles 
are isolated and the pAdARSVDys- containing particles separated from the helper by 
virtue of their smaller genome size and therefore different density on CsCl gradients. 

To generate pseudotyped particles containing the pAdARSVDys genome, the 
vector is grown in either 633 or 705 cells and Ad5.pgaLAF is used as a helper virus. 
As in the published method, both the Ad5 . pgal.AF and AdARSVDys genomes replicate 
and are packaged into particles. The Ad5 . pgal.AF helper provides all the essential Ad 
proteins except fiber, and the fiber protein is that produced by the cells (Ad5 fiber in 
633 cells and Ad37 fiber in the case of 705 cells). The particles containing 
AdARSVDys genomes are then isolated by centrifugation. 
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Tropism is evaluated by infecting Chang C cells (which express the Ad37 
receptor) and A549 cells which do not express this protein but do express the natural 
Ad5 receptor (CAR). The extent of infection is assessed by immunofluorescence 
staining of the infected cells with an anti-dystrophin antibody. By using purified 
recombinant Ad5 and Ad37 fiber proteins as competitors during infection, the usage 
of the expected receptors by the pseudotyped particles is evaluated. 

Example 15 
Targeting EBV-Infected B Cells 

There are a number of cell types, such as EBV-transformed B-lymphocytes, 
that are involved in human disease which are not transducible using standard Ad 
vectors. To address this problem 'pseudotyped' Ad5 Pgal.AF particles containing 
either the wildtype Ad5 fiber protein or a chimeric fiber with the receptor-binding knob 
domain of the adenovirus type 3 (Ad3) fiber were generated. (Von Seggern et al, J. 
Virol January, 2000). The strategy used for targeting the B-cells should be broadly 
applicable for targeting gene delivery to other specific cell types. 

Cells and Viruses, THP- 1 , MRC-5 , FaDu, and A- 1 0 cells were purchased from 
the ATCC. 21 IB is a 293-derived cell line that expresses the wild-type Ad5 fiber 
protein (Von Seggern et al, J. Gen. Virol 79: 1461-1468 (1998)). El- 2a (Gorziglia 
etal,J. Virol 70:4173-4178(1996)) is an A549-derived cell line which complements 
adenoviral El and E2a functions. The JR, TO, and TL LCL lines were established as 
described (Huang et al, Proa Natl Acad Scl 94:8156-8161 (1997) by EBV 
infection of lymphocytes from three normal donors. THP-1 and all LCL lines were 
maintained in RPMI 1640 medium (Gibco) + 10% fetal calf serum (FCS) (Hyclone). 
21 IB, MRC-5, and A-10 cells were grown in DMEM + 10% FCS. El-2a and its 
derivatives were grown in Richter's modified medium (BioWhitaker) + 10% FCS. 
Peripheral blood mononuclear cells were isolated from normal human blood (General 
Clinical Research Center, Scripps Clinic) by sedimentation on Ficoll-Paque (Pharmacia) 
per the manufacturer's instructions. Wild type Ad2 and Ad3 were purchased from the 
ATCC. Construction of Ad5.pgal.wt and Ad5. Pgal.AF (Von Seggern et al, J. Virol 



- 118- 

73:1601-1608 (1999)) has been previously described. AvlLacZ4 (Mitttereder etal, 
J. Virol 70:7498-7509 (1996)) is a first-generation Ad5 vector containing an RSV- 
driven p-galactosidase reporter gene. Av9LacZ4 (Stevenson etal., J. Virol 77:4782- 
4790 (1997)) is identical to AvlLacZ4 except that the fiber gene in the vector 
chromosome was replaced by a recombinant gene encoding a chimeric fiber protein 
with the receptor-binding domain of the Ad3 fiber (Stevenson et al, J. virol 69:2850- 
2857 (1995)). Accession numbers for the above are as follows. THP-1: TIB-202, 
MRC-5: CCL-171, FaDu: HTB-43, A-10: CRL-1476, Ad2: VR-846, Ad3: VR-3. 

DNA constructs. The complete Ad5 tripartite leader contained in pDV67 and 
pDV69 was constructed by assembly of PCR fragments. pDV55 was constructed 
similar to Example 5. This plasmid contains a 1.2 kb Bam Hl/Bgl II fragment 
consisting of the first TPL exon, the natural first intron, and the fused second and third 
TPL exons. Finally, pDV60 was constructed by inserting this TPL cassette into the 
Bam HI site upstream of the Ad5 fiber gene in pcDNA3/Fiber (Von Seggern et al, J. 
Gen. Virol 79:1461-1468(1998)). pDV61 and pDV67 were then constructed similar 
to example 6. 

The chimeric Ad3/Ad5 fiber gene was amplified from pGEM5T3H (Stevenson 
etal, J. Virol. 69:2850-2857 (1995) using the primers 5' ATG GGA TCC AAG ATG 
AAG CGC GCA AGA CCG 3 ' (SEQ ID NO: 75) and 5' CAC TAT AGC GGC CGC 
ATT CTC AGT CAT CTT 3' (SEQ ID NO: 76) , and cloned to the Bam HI and Not 
I sites of pcDNA3. l/Zeo(+) via novel Bam HI and Not I sites (bold) engineered into 
the primers to create pD V68 . Finally, the complete TPL fragment described above was 
then added to the unique Bam HI site of this plasmid to create pDV69. 

Construction of Stable Cell Lines. El -2a cells were electroporated as 
previously described (Von Seggern et al, J. Gen. Virol 79: 1461-1468 (1998)) with 
pDV61, pDV67, or pDV69, and stable lines were selected with 600 jig/ml Zeocin 
(Invitrogen). Candidate clones were evaluated by immunofluorescence (Von Seggern 
et al, J. Gen. Virol 79:1461-1468 (1998)) using a polyclonal antibody generated 
against the Ad2 fiber (Wickham etal, Cell 75:309-3 1 9 (1993). Those lines expressing 
the highest level of nuclear fiber expression were further characterized. Line 601 and 
633 were produced by transfection of pDV61 and pDV67, respectively, and therefore 
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express the wildtype Ad5 fiber. Line 644 contains pDV69 and expresses the chimeric 
5T3H fiber. 

Virus Growth and Analysis. Adenovirus stocks were prepared in the indicated 
cell lines, and plaque-titered on 633 cells essentially as described (Von Seggern et al, 
J. Virol 73:1601-1608(1999)). El -2a cells (Gorziglia et al, J. Virol 70A\11>-A\1% 
(1996). and their derivatives contain a dexamethasone-inducible construct for 
complementation of Ela. 601, 633, or 644 cells were therefore treated with 0.3 
dexamethasone for 24 hours prior to infection, and 0.5 [iM dexamethasone was 
included in the overlay for plaque assays. Protein concentration of viral preparations 
was determined using the BioRad Protein Assay (BioRad) with purified bovine serum 
albumin as a standard. Particle number was calculated using the formula 1 ^ig protein 
- 4 x 10 9 viral particles. Western blotting was performed as described (Von Seggern 
etal,l Gen. Virol 79:1461-1468 (1998)) using polyclonal rabbit antibodies raised 
against either the Ad2 (Wickham et al, Cell 73:309-319 (1993) or Ad3 fibers 
(Stevenson et al, J. Virol 77:4782-4790 (1997). 

Determination of infection and binding to receptor was performed using 
methods known to those of skill in the art. 2 x 1 0 5 cells in a total volume of 200 \A were 
incubated with the indicated Ad preparation for three hours at 37 °C. Cells were then 
washed twice with fresh medium, and returned to 37 °C. Two days later, cells were 
fixed and stained with X-gal and counted by light microscopy as described (Von 
Seggern e/a/., J. Virol 73:1601-1608(1999)). For competition assays, cells were pre- 
incubated on ice for one hour with either recombinant Ad3 fiber (10 ng/ml) purified 
from baculovirus or with a crude baculovirus lysate (100 [ig/ml) containing the 
recombinant Ad2 fiber protein (Wickham et al, Cell 73:309-3 19 (1997)). Expression 
of a v integrins on cell surfaces was assayed by FACS assay using monoclonal 
antibodies (the gift of David Cheresh, TSRI) against either cc v P 3 (LM609) or a v P 5 
(P1F6) as previously described (Huang et al, Proa Natl Acad. Set USA 94:8156- 
8161 (1997)). For virus binding assays, CsCl-purified Ad2 or Ad3 was labeled with 
125 I using Iodogen tubes (Pierce). Free iodine was removed by filtration with a PD- 1 0 
Sephadex column (Pharmacia). Cells (1 x 10 6 cells in a volume of 200 ^1 either with 
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or without a 100-fold excess of unlabeled virus) were rocked at 4 °C for two hours 
with 1 x 10 6 cpm of the labeled virus, washed three times with PBS and counted. 

Altered in vitro tropism and infection of B lymphoid cell lines. 
Experiments with genetically modified viruses showed that a number of different cell 
types are more readily infected through interaction with the Ad3 receptor than by the 
CAR-dependent pathway used by Ad5 (Stevenson et aL, J, Virol. 77:4782-4790 
(1997)). In order to further evaluate the pseudotyping system, the ability of 
Ad5 . Pgal.AF carrying either the Ad5 or chimeric 5T3H fibers to infect several cell lines 
was assayed: FaDu (a head and neck tumor line), THP- 1 monocytic cells, and MRC-5 
fibroblasts were assayed. Consistent with the previous studies (Stevenson et aL, J. 
Virol 77:4782-4790 (1997)), use of the chimeric Ad5/Ad3 fiber protein increased 
infection of all of these lines at equal particle/cell ratios. In contrast, the rat smooth 
muscle cell line A- 10 was infected somewhat more readily by Ad5- than by Ad3- 
pseudotyped particles. 

Gene delivery to EBV-infected B cells could allow the development of 
therapies for a variety of lymphoproliferative disorders. For example, ex vivo purging 
of donor marrow to eliminate infected cells could reduce the risk of EBV-associated 
lymphoproliferative disease, and EBV-induced malignancies such as AIDS-associated 
lymphoma are also potential targets. However, neither B cells nor EBV-transformed 
lymphoblastoid cell lines (LCLs) are efficiently infected by Ad5-based vectors. As the 
tropism of Ad3-pseudotyped particles appeared to be somewhat broader, it was asked 
whether EBV-infected LCLs could be infected using this system. The ability of Ad3- 
pseudotyped particles to infect LCLs generated by EBV infection of lymphocytes from 
three different normal human donors was tested. In agreement with previous reports, 
there was little or no infection of these by particles carrying the Ad5 fiber. In contrast, 
virus particles equipped with the chimeric fiber protein were able to efficiently infect 
all of these lines. At equal particle/cell ratios, all LCLs examined were at least 10-fold 
more infectible using the Ad3 receptor. 

Further studies were performed to correlate the efficiency of infection with the 
level of attachment and internalization receptors expressed by the cells. The three LCL 
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lines tested all bound very low levels of radiolabeled Ad2 particles, indicating that they 
expressed little or no CAR. In contrast, all three were able to specifically bind labeled 
Ad3 particles. This result suggested that fiber receptor distribution was largely 
responsible for the increased infection of these cells by Ad3-pseudotyped particles. 

Selective gene delivery to EB V-infected cells. The results above suggested that 
the minority of EBV-infected B cells present in donor marrow or peripheral blood 
would be preferentially infected by vectors using the Ad3 receptor. To test this 
hypothesis, a mixing experiment with normal uninfected peripheral blood mononuclear 
cells (PBMCs) and EBV-infected cells was performed. JR-LCL cells were mixed at 
varying ratios with PBMCs isolated from a normal human donor, and the mixture was 
then infected with Ad5.pgal.AF particles containing the 5T3H fiber protein. No 
infection of normal PBMCs alone was detected. Moreover, the percent of total cells 
infected increased with the fraction of JR cells added. These experiments indicate that 
EBV-infected cells can be selectively infected in vitro by relatively short (3 hours) 
exposure to a retargeted Ad vector. 

Example 16 

Production of Adenovirus Vectors by Addition of Exogenous Fiber 

The production of fiberless viruses by growth in a complementing cell line may 
result in a preparation that also contains contaminating fiber genome resulting from 
recombination in the complementing cell lines. This disadvantage is eliminated by 
addition of exogenous fiber to a fiberless adenovirus vector. 

Production of fiberless virus by standard methods may include a two-step 
preparation protocol. This has been described in the earlier examples and is briefly 
described here again as follows: 

Step I - amplification of fiber containing fiberless virus (Ad5/F'/F + or 
AdS.pgalAF - fiberless, but there is fiber on the surface, not encoded in genome) on 
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21 IB cell line (which stably expresses fiber), followed by CsCl-purification and 
characterization. 

Step II - preparation of virus particles lacking fiber (Ad5F~) by infection of S.8 
cell line with Ad5/F7F + , followed by CsCl purification and characterization. This 
5 produces a large stock of particles which do not contain fiber. 

Step 1 is necessary because the infection efficiency of fiberless virus is 
extremely low, e.g. the dose of 20,000 particles/cell of Ad5/(3galF" gives only 10% 
infected cells. 

Contrary to the above, the production of fiberless virus by addition of 

10 exogenous fiber involves only a one-step protocol The fiberless virus is amplified 

using the S.8 cell line with addition of exogenous fiber into infection media. The 
amount of exogenous fiber necessary for production is very low, no more than 75ng 
of purified fiber required per roller bottle. If desired the process may be followed by 
CsCl purification. As mentioned above, one advantage to this protocol is that it should 

15 provide no chance for recombination of adenovector during preparation. 

A 1 0 roller bottle (RB) preparation of fiberless virus was made using the above 
two-step procedure. The yield of adenovector was 6.6xl0 12 particles - total Ad/PgalF 
(two step procedure.) A 1 RB preparation of fiberless adenovector was also made from 
the same initial material using a one-step procedure with exogenous fiber. The total 

20 yield was 2.5xl0 n particles - AdSPgalF (one step procedure). 

DNA was isolated from both preparations and a PCR assay for fiber 
contamination was performed. (Figure 28). The PCR assay was developed for 
detection of very low amounts of fiber contamination, as low as 10" 18 g. PCR assay 
showed much lower contamination for the preparation which was done by adding 

25 exogenous fiber (10" 15 g one-step procedure) vs. 10" 8 two-step procedure). Therefore, 

less contamination was obtained by simpler one-step approach. 

Experiments were done using soluble purified fiber which does not have His-taq 
on the end (AdSFiber = 5F) and with His-taq on the end (Ad5Fiber His = 5FHis). 
These experiments showed that addition of Ad5Fiber can dramatically increase 

30 transduction efficiency of fiberless adenovector by simply adding it exogenously to a 
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fiberless vector. The presence of the His tag on the Ad5FiberHis doesn't have any 
effect. 

The results of these experiments suggest that the fiber is self-assembling with 
the fiberless vector. This self-assembled virus can then infect cell through the normal 
5 entry pathway. (Figure 29) Also, an experiment was done using conditioned media 

from 633 cell line, which can stably express fiber. A Western blot analysis for 633 
condition media, showed that soluble fiber was present in the media during the period 
of cultivation of this cell line. Presence of soluble fiber in the media gives the 
possibility to increase transduction efficiency of fiberless adenovector on the HDF cell 

10 line. (Figure 30) Because the HDF cell line doesn't have a CAR-receptor, it is 

especially difficult to transduce this particular cell line, not only with fiberless vector, 
but also with regular fiber containing adenovector. Different amounts of 633 
conditioned media (250|il, 500^1 or 1000|nl) were added to infectious media during the 
incubation period with fiberless adenovector. 

1 5 This experiment also showed a role of soluble fiber in the process of cell entry. 

The conclusion is that by adding any fiber (wild-type, mutated, with ligand fusions) as 
long as on has the wild-type shaft (or region necessary to bind penton) one can retarget 
fiberless vector with any genome inside (gutless, oncolytic, expressing any transgene, 
etc.) to any cell type that your fiber is specific to. The advantage of this approach is 

20 that one does not have to make vectors with each new ligand. Just one fiberless vector 

need be made that can then be used to make different backbones by adding an 
exogenous "targetable" fiber off the shelf. 

The foregoing specification, including the specific embodiments and examples, 
is intended to be illustrative of the present invention and is not to be taken as limiting. 
25 Numerous other variations and modifications can be effected without departing from 

the true spirit and scope of the present invention. All publications, patents and patent 
applications cited herein are incorporated by reference in their entirety into the present 
disclosure. 
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What is claimed is: 



1 . An isolated nucleic acid molecule comprising an adenovirus tripartite 
leader (TPL) nucleotide, said TPL nucleotide sequence comprising (a) first and second 
different TPL exons or (b) first, second and third same or different TPL exons, said 
TPL exons selected from the group consisting of complete TPL exon 1, partial TPL 
exon 1, complete TPL exon 2 and complete TPL exon 3. 

2. The isolated nucleic acid molecule of claim 1, wherein said sequence is 
operatively linked to an intron containing an RNA processing signal. 

3. The isolated nucleic acid molecule of claim 1 wherein said TPL 
nucleotide sequence consists essentially of complete TPL exon 1 operatively linked to 
complete TPL exon 2 operatively linked to complete TPL exon 3. 

4. The isolated nucleic acid molecule of claim 1 wherein said intron is 
native adenovirus intron 1 . 

5. The isolated nucleic acid molecule of claim 1 wherein said TPL 
nucleotide sequence is shown in SEQ ID NO: 32. 

6. The isolated nucleic acid molecule of claim 1 further comprising a 
promoter and a nucleic acid sequence which encodes an adenoviral structural protein, 
operatively linked to said promoter and said TPL sequence. 

7. The isolated nucleic acid molecule of claim 6 wherein said adenoviral 
structural protein is a fiber protein or a chimeric protein which includes an adenovirus 
fiber protein tail domain. 
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8. The isolated nucleic acid molecule of claim 7 wherein said chimeric 
protein comprises an Ad3 head domain and an Ad5 tail domain or an Ad5 head domain 
and an Ad3 tail domain. 

9. The isolated nucleic acid molecule of claim 7 wherein said molecule is 
contained in a plasmid selected from the group consisting of plasmids pCLF, pDV60, 
pDV67, pDV69 P pDV80 and pDV90. 

1 0. The isolated nucleic acid molecule of claim 9 wherein said molecule has 
a nucleotide sequence selected from the group consisting of sequences shown in SEQ 
ID NO: 8, SEQ ID NO; 43, SEQ ID NO: 44, SEQ ID NO: 47, SEQ ID NO: 64 and 
SEQ ID NO: 65.. 

11. An adenovirus vector complementing plasmid comprising an isolated 
nucleic acid molecule according to claim 1 . 

12. An adenovirus vector packaging cell line comprising a stably integrated 
nucleic acid molecule of claim 1, an operatively-linked promoter and a nucleic acid 
sequence which encodes an adenovirus structural protein, wherein said TPL sequence 
consists essentially of a first TPL exon operatively linked to a complete second TPL 
exon operatively linked to a complete third TPL exon. 

13. The cell line of claim 12 wherein said first TPL exon is a complete or 
partial first TPL exon. 

14. The cell line of claim 1 3 wherein said TPL molecule comprises complete 
TPL exon 1 having the nucleotide sequence of SEQ ED NO: 32 or partial TPL exon 1 
having the nucleotide of SEQ ID NO: 26. 



15. The cell line of claim 12 wherein said promoter is an inducible 
promoter. 
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16. The cell line of claim 12 wherein said adenovirus structural protein is 
adenovirus fiber protein or a chimeric protein which includes an adenovirus fiber 
protein tail domain. 

17. The cell line of claim 12 wherein said chimeric protein comprises an 
5 Ad3 head domain and an Ad5 tail domain or an Ad5 head domain and an Ad3 tail 

domain. 

1 8 . The cell line of claim 1 2 wherein said nucleic acid molecule is selected 
from the group consisting of plasmids pDV60, pDV67 ? pDV69, pDV80 and pDV90. 

: 19. The cell line of claim 18 wherein said nucleic acid molecule has a 

j 1 0 nucleotide sequence from the group consisting of sequences shown in SEQ ID NO : 43 , 

SEQ ID NO: 44 and SEQ ID NO: 47. 

I 20. The cell line of claim 12 wherein said cell line is an epithelial cell line. 

I 21. The cell line of claim 20 wherein said cell line supports the production 

of a recombinant adenovirus vector genome by complementation of a deficient viral 
1 5 gene in said vector genome. 

22. The cell line of claim 21 wherein said cell line further produces an 
adenovirus protein and thereby complements a deficient adenovirus gene in said vector 
genome, and wherein said cell line complements an adenovirus early protein gene and 
a fiber gene. 



20 



23. The cell line of claim 22 wherein the deletion of said deficient 
adenovirus gene is complemented by the expression of said gene under the control of 
an inducible promoter. 
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24. A recombinant adenovirus particle comprising a recombinant adenovirus 
vector genome wherein said genome does not encode or does not express sufficient 
adenovirus fiber protein to support packaging of a fiber-containing adenovirus particle 
without complementation of said fiber gene. 

25. The recombinant adenovirus particle of claim 24 wherein said 
adenovirus vector genome does not encode one or more functional proteins selected 
from the group consisting of El A, E1B, E2A, E2B, E3 and E4 protein. 

26. The particle of claim 24 wherein said adenovirus vector genome is 
Ad5.Bgal.AF. 

27. The particle of claim 24 wherein said adenovirus vector genome is 
contained in the adenovirus particle deposited under ATCC accession # VR2636 and 
corresponding to Ad5.Bgal.AF. 

28. The particle of claim 24 wherein said particle lacks fiber protein or 
contains a modified fiber protein. 

29. The particle of claim 24 wherein said particle comprises an adenovirus 
fiber protein or a chimeric protein having an adenovirus fiber protein tail domain, said 
chimeric protein comprising an Ad3 head domain and an Ad5 tail domain or an Ad5 
head domain and an Ad3 tail domain. 

30. The particle of claim 24 wherein said exogenous protein is a therapeutic 
gene product. 

31. A helper-independent fiberless recombinant adenovirus vector genome 
comprising genes which: 

(a) encodes all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to package a fiber-containing adenovirus particle 
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without complementation of said fiber gene or said genome lacks at least the fibre gene, 
and 

(b) encodes an exogenous protein. 

32. The adenovirus vector genome of claim 31 wherein said adenovirus 
vector genome does not encode one or more functional proteins selected from the 
group consisting of El A, E1B ; E2A, E2B, E3 and E4 protein. 

33. The adenovirus vector genome of claim 31 wherein said adenovirus 
vector genome is Ad5.BgaLAF. 

34. The adenovirus vector genome of claim 33 wherein said adenovirus 
vector genome has a nucleotide sequence shown in SEQ ID NO:27 and corresponds 
to Ad5.Bgal.AF. 

35. The adenovirus vector genome of claim 31 wherein said adenovirus 
vector genome is contained in the adenovirus particle deposited under ATCC accession 
VR-2636 corresponding to Ad5.Bgal.AF. 

36. The adenovirus vector genome of claim 31 wherein said exogenous 
protein is a therapeutic gene product. 

37. An isolated nucleic acid that comprises the adenovirus vector genome 
of claim 3 1 . 

38. A method for producing an adenovirus vector particle containing a 
helper-independent fiberless recombinant adenovirus vector genome, said method 
comprising providing a packaging cell line which complements replication and 
packaging of said genome and a helper-independent fiberless recombinant adenovirus 
vector genome which is deficient in expressing sufficient functional fiber protein to 
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support assembly of fiber-containing particles and harvesting said particle produced by 
said cell line. 

39. The method of claim 3 8 wherein said packaging cell line complements 
adenovirus fiber protein. 

40. The method of claim 38 wherein said adenovirus vector genome 
comprises genes that: 

(a) express all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to package a fiber- containing adenovirus particle 
without complementation of said fiber gene or said genome lacks at least the fibre gene, 
and 

(b) express an exogenous protein. 

41. The method of claim 38 wherein said packaging cell line comprises a 
stably integrated first nucleic acid molecule alternatively operatively linked to a 
promoter, and said first nucleic acid is operatively linked to a second nucleic acid 
molecule encoding an adenovirus structural protein, wherein said first nucleic acid 
molecule comprises an adenovirus tripartite leader (TPL) nucleotide sequence 
operatively linked to an intron containing an RNA processing signal, said TPL 
nucleotide sequence comprising (a) first and second different TPL exons or (b) first, 
second and third different TPL exons, said TPL exons selected from the group 
consisting of complete TPL exon 1, partial TPL exon 1, complete TPL exon 2 and 
complete TPL exon 3. 

42. The method of claim 38 wherein said helper-independent fiberless 
recombinant adenovirus vector genome is introduced by infecting said cell line with a 
virus particle containing said genome. 



43. The method of claim 42 wherein said particle is a particle comprising 
a helper-independent recombinant adenovirus vector genome comprising genes that: 
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(a) encode all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to support packaging of a fiber-containing 
adenovirus particle without complementation of said fiber gene or said genome lacks 
at least the fibre gene, and 

(b) encode an exogenous protein, 

wherein said particle comprises an adenovirus fiber protein or a chimeric 
protein that includes an adenovirus fiber protein tail domain. 

44. The method of claim 38 wherein said helper-independent fiberless 
recombinant adenovirus vector genome is introduced into said cell line by transfecting 
said cell line with said helper-independent fiberless recombinant adenovirus vector 
genome 

45. The method of claim 44 wherein said adenovirus vector genome 
comprises genes which: 

(a) encode all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to package a fiber-containing adenovirus particle 
without complementation of said fiber gene or said genome lacks at least the fibre gene, 
and 

(b) encode an exogenous protein. 

46. The method of claim 38 wherein said packaging cell line is transfected 
with a nucleic acid molecule encoding adenovirus fiber protein. 

47. The method of claim 46 wherein said nucleic acid molecule is a nucleic 
acid molecule comprising an adenovirus tripartite leader (TPL) nucleotide sequence, 
said TPL nucleotide sequence comprising (a) first and second different TPL exons or 
(b) first, second and third different TPL exons, said TPL exons selected from the group 
consisting of complete TPL exon 1, partial TPL exon 1, complete TPL exon 2 and 
complete TPL exon 3 and said molecule further comprises a sequence encoding 
adenovirus fiber protein. 
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48. The method of claim 39 wherein said adenovirus fiber protein is a 
modified fiber protein. 

49. The method of claim 38 further comprising the step of coating said 
particle with adenovirus fiber protein. 

5 50. A method for delivery of an exogenous gene to a target cell comprising 

contacting said cell with an amount of a recombinant adenovirus particle of claim 24 
sufficient to infect said cell 

51. The method of claim 50 wherein said exogenous gene encodes a 
therapeutic gene product. 

1° 52. The method of claim 5 1 wherein said recombinant adenovirus particle 

contains a modified fiber protein which binds a preselected target cell and directs 
delivery of the particle to said target cell 

53. The method of claim 50 wherein said recombinant adenovirus particle 
comprises a helper-independent fiberless recombinant adenovirus vector genome 

1 5 comprising genes that: 

(a) encode all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to package a fiber-containing adenovirus particle 
without complementation of said fiber gene or said genome lacks at least the fibre gene, 
and 

20 (b) encode an exogenous protein. 

54. The method of claim 50 wherein said modified fiber protein has an 
amino terminal head domain which binds to a v integrins and thereby targets cells with 
a v integrin receptors. 



55. 



The method of claim 50 wherein said contacting is conducted in vitro. 
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56. The method of claim 55 wherein said contacting is conducted on cells 
of a tissue which are first removed from the body of a patient, and the cells are 
subsequently returned to said patient. 

57. The method of claim 50 wherein said contacting is conducted in vivo 
5 by administering said recombinant adenovirus particle to a tissue of said patient. 

58. The method of claim 57 wherein said administering is intravenously, 
intraperitoneal^, by aerosol, topically or by injection. 

59. A method for pseudotyping recombinant viral vectors comprising 
complementing a missing fiber gene of a helper-independent or helper dependent 

10 fiberless recombinant adenovirus vector genome by expressing in packaging cells a 

fiber gene from a different adenoviral serotype than said recombinant adenovirus 
vector, thereby pseudotyping said vector. 

60. A method for specifically targeting an adenovirus vector to a cell of 
choice comprising introducing a helper-independent or helper-dependent fiberless 

1 5 recombinant adenovirus vector genome into a packaging cell line for producing a fiber 

gene-deleted adenovirus vector, wherein a gene for a missing fiber protein is 
complemented with a gene for a desired modification for targeting the vector to a cell 
of choice. 



61. A method for producing a modified adenovirus comprising providing in 
20 vitro an exogenous fiber protein to a fiberless adenovirus. 

62. The method of claim 61, wherein said fiber is provided by adding fiber 
protein in a suitable buffer to a fiberless virus preparation, thereby producing a 
modified adenovirus. 



- 133- 

63. The method of claim 61, wherein a helper-independent or helper- 
dependent fiberless recombinant adenovirus vector genome is introduced into a 
packaging cell line to produce a fiberless adenovirus to which exogenous fiber protein 
will be provided. 

64. A method for delivering a heterologous gene to EBV-infected B cells 
comprising infecting said B cells with a pseudotyped Ad5Pgal.AF particle or other 
fiber-deleted adenovirus particle, said particle having a chimeric fiber including the 
receptor-binding knob domain of the adenovirus type 3 fiber. 

65. The adenovirus particle of claim 24 wherein said adenovirus vector 
genome lacks a fibre gene or lacks a portion of the fibre gene sequence such that fibre 
protein is not expressed in sufficient quantities to support packaging. 

66. The recombinant adenovirus particle of claim 24 comprising a helper- 
independent recombinant adenovirus vector genome comprising genes that: 

(a) encode all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to support packaging of a fiber-containing 
adenovirus particle without complementation of said fiber gene or said genome lacks 
at least the gene encoding fibre, and 

(b) optionally encodes an exogenous protein. 

67. The recombinant adenovirus particle of claim 24 wherein said particle 
further comprises a nucleic acid encoding an exogenous protein. 

68. A method for producing a modified adenovirus comprising providing 
a packaging cell line for producing a fiberless adenovirus helper- dependent fiberless 
recombinant adenovirus vector genome and a helper virus vector, wherein said cell line 
complements at least a deficient fiber protein gene, thereby producing the modified 
adenovirus. 
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69 The packaging cell line of claim 9 wherein said cell line is selected from 
the group consisting of 293, A549, W163, HeLa, Vero ? 21 1, 21 1 A and an epithelial 
cell line comprising the stably integrated nucleic acid molecule. 

70. The recombinant adenovirus particle of claim 21 wherein said 
exogenous protein is selected from a group consisting of a tumor-suppressor protein, 
a biologically active fragment thereof, a suicide protein and a biologically active 
fragment thereof. 

71. A composition for preparing a therapeutic vector, said composition 
comprising a plasmid comprising an adenovirus genome lacking a nucleotide sequence 
encoding a fiber protein or a genome that is incapable of expressing sufficient fiber to 
result in packaging. 

72. A method of delivering a heterologous gene to a human or any animal 
comprising providing heterologous gene to a target cell wherein said target cell is 
contacted in vivo or ex vivo with an amount of a recombinant adenovirus particle of 
claim 24 sufficient to infect said cell and thereby deliver the heterologous gene. 

73. The recombinant adenovirus vector particle of claim 24, wherein no 
fiber protein is expressed. 

74. The recombinant adenovirus vector genome of claim 31, wherein no 
fiber protein is expressed. 



20 



75. The recombinant adenovirus vector particle of claim 24, wherein said 
genome expresses insufficient fiber to allow incorporation of said protein into the 
particle such that the particle cannot use the fiber pathway for infection. 
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76. The recombinant adenovirus genome of claim 31, wherein said genome 
expresses insufficient fiber to allow incorporation of said protein into a particle such 
that the particle cannot use the fiber pathway for infection. 

77. A method for producing a gutless adenoviral vector particle comprising: 
a) delivering a helper adenovirus vector genome to an adenovirus vector 

packaging cell, wherein said helper adenovirus vector genome lacks any gene encoding 
adenovirus fiber protein or lacks the ability to encode sufficient adenovirus fiber protein 
to produce an adenoviral vector comprising fiber protein in the absence of 
complemetation by said packing cell and wherein said packaging cell comprises the 
nucleic acid molecule of claim 2 operably linked to a promoter and to an adenoviral 
fiber protein or to a chimeric protein that includes an adenovirus fiber protein tail 
domain; 

(b) delivering a gutless adenovirus vector genome to said packaging 

cell; and 

(c) recovering the gutless adenoviral vector particle produced by said 

cell. 



78. The method of claim 77, wherein said helper adenovirus vector genome 
is delivered by viral infection. 

79. The method of claim 78, wherein said gutless adenovirus vector genome 
is delivered by transfection. 

80. The method of claim 77, wherein said gutless adenovirus vector genome 
comprises an operable packaging sequence. 

8 1 . The method of claim 80, wherein said helper adenovirus vector genome 
has a mutation in its packaging sequence that renders said genome substantially 
incapable of being packaged as an adenoviral vector particle by said packaging cell 
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82. The method of claim 80, wherein said helper adenovirus vector genome 
comprises recombinase sites flanking its packaging sequence and said packaging cell 
further comprises a nucleotide sequence encoding a recombinase. 

83 . The method of claim 82, wherein said recombinase site is a lox site and 
said recombinase is Cre. 



84. A helper adenovirus particle comprising an adenovirus vector genome 
that does not encode or does not express sufficient adenovirus fiber protein to support 
packaging of a fiber-containing adenovirus particle without complementation of said 
fiber gene, wherein said genome has a mutation in its packaging sequence that renders 
said genome substantially incapable of being packaged. 

85. The helper adenovirus particle of claim 84, wherein said mutation 
comprises a deletion of at least one nucleotide in said packaging sequence. 

86. The helper adenovirus particle of claim 85, wherein said adenovirus 
vector genome does not encode functional proteins selected from the group consisting 
of E1A, E1B, E2A, E2B, E3, and E4 proteins. 

87. A helper adenovirus particle comprising an adenovirus vector genome 
with recombinase sites flanking its packaging sequence, wherein said vector genome 
does not encode or does not express sufficient adenovirus fiber protein to support 
packaging of a fiber-containing adenovirus particle without complementation of said 
fiber gene. 



88. The helper adenovirus particle of claim 87, wherein said adenovirus 
vector genome does not encode functional proteins selected from the group consisting 
of E1A, E1B, E2A, E2B, E3, and E4 proteins. 
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89 An adenovirus particle comprising a gutless adenoviral vector genome 
and a fiberless capsid. 

90. An adenovirus particle comprising a gutless adenoviral vector genome 
and a capsid comprising a modified fiber protein. 

91. A packaging cell for the production of a fiberless or fiber-modified 
gutless adenovirus particle comprising an adenovirus vector complementing plasmid 
and a nucleotide sequence encoding a recombinase, wherein said complementing 
plasmid comprises the nucleic acid molecule of claim 2 operably linked to a promoter 
and to a nucleotide sequence encoding an adenoviral fiber protein or a chimeric 
adenoviral fiber protein. 

92. The packaging cell of claim 91, wherein said complementing plasmid 
and said nucleotide sequence encoding a recombinase are stably integrated into the 
genome of said cell. 

93 . The packaging cell of claim 9 1 , further comprising a helper adenovirus 
vector genome. 



The packaging cell of claim 91, wherein said recombinase is Cre. 
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ABSTRACT 



The present invention relates to methods for gene therapy, especially to 
adenovirus-based gene therapy, and related cell lines and compositions. In particular, 
novel nucleic acid constructs and packaging cell lines are disclosed, for use in 
facilitating the development of high-capacity and targeted vectors. The invention also 
discloses a variety of high-capacity adenovirus vectors and related compositions and 
kits including the disclosed cell lines and vectors. Finally, the invention discloses 
methods of preparing and using the disclosed vectors, cell lines and kits. 
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SEQUENCE LISTING 



<110> VON SEGGERN, DANIEL 
NEMEROW, GLEN R. 
HALLENBECK, PAUL 
STEVENSON, SUSAN 
SKRIPCHENKO, YELENA 



<120> ADENOVIRUS VECTORS , PACKAGING CELL LINES, COMPOSITIONS, 
AND METHODS FOR PREPARATION AND USE 

<130> 1294.0010001 

<140> 
<141> 

<160> 76 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 1 

cggtacacag aattcaggag acacaactcc 30 



<210> 2 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 2 

gcctggatcc gggaagttac gtaacgtggg aaaac 35 



<210> 3 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker 

<400> 3 
cgcggatccg eg 



<210> 4 
<211> 8710 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: plasmid 



<400> 4 

cacctaaatt gtaagcgtta atattttgtt 
ctcatttttt aaccaatagg ccgaaatcgg 
cgagataggg ttgagtgttg ttccagtttg 
ctccaacgtc aaagggcgaa aaaccgtcta 
accctaatca agttttttgg ggtcgaggtg 
gagcccccga tttagagctt gacggggaaa 
gaaagcgaaa ggagcgggcg ctagggcgct 
caccacaccc gccgcgctta atgcgccgct 
gcgcaactgt tgggaagggc gatcggtgcg 
agggggatgt gctgcaaggc gattaagttg 
ttgtaaaacg acggccagtg aattgtaata 
gccccccctc gaggtcgacg gtatcgataa 
caagtgcata ctctatgtca ttttcatggg 
tatttgccac atcctcttac actttttcat 
ttatgtttca acgtgtttat ttttcaattg 
agtatagccc caccaccaca tagcttatac 
accctagtat tcaacctgcc acctccctcc 
cggctggcct taaaaagcat catatcatgg 
cacacggttt cctgtcgagc caaacgctca 
tcacttaagt tcatgtcgct gtccagctgc 
tgcttaacgg gcggcgaagg agaagtccac 
atcaggatag ggcggtggtg ctgcagcagc 
gtcctgcagg aatacaacat ggcagtggtc 
ataaggcgcc ttgtcctccg ggcacagcag 
taactgcagc acagcaccac aatattgttc 
aagctcatgg cggggaccac agaacccacg 
aagtggcgac ccctcataaa cacgctggac 
ttcaccacct cccggtacca tataaacctc 
ctaaaccagc tggccaaaac ctgcccgccg 
caatgacagt ggagagccca ggactcgtaa 
atgttggcac aacacaggca cacgtgcata 
gttagaacca tatcccaggg aacaacccat 
ggaagacctc gcacgtaact cacgttgtgc 
ggatgatcct ccagtatggt agcgcgggtt 
ctgtacggag tgcgccgaga caaccgagat 
acgccggacg tagtcatatt tcctgaagca 
gcgtctccgg tctcgccgct tagatcgctc 
caaagcatcc aggcgccccc tggcttcggg 
cctgataaca tccaccaccg cagaataagc 
cgagtcacac acgggaggag cgggaagagc 
aaagattatc caaaacctca aaatgaagat 
cgtggtcaaa ctctacagcc aaagaacaga 
cttccaaaag gcaaacggcc ctcacgtcca 
gaatctcctc tataaacatt ccagcacctt 
accttctcaa tatatctcta agcaaatccc 
gctccagagc gccctccacc ttcagcctca 
ttcctcacag acctgtataa gattcaaaag 
taggtccctt cgcagggcca gctgaacata 
cacttccccg ccaggaacct tgacaaaaga 
agctatgcta accagcgtag ccccgatgta 
gcaaggtgct gctcaaaaaa tcaggcaaag 
catgctcatg cagataaagg caggtaagct 
ttctctcaaa catgtctgcg ggtttctgca 
ttaaacatta gaagcctgtc ttacaacagg 
tacggccatg ccggcgtgac cgtaaaaaaa 
cagctcctcg gtcatgtccg gagtcataat 
catcggtcag tgctaaaaag cgaccgaaat 
gagacaacat tacagccccc ataggaggta 



aaaattcgcg ttaaattttt gttaaatcag 60 
caaaatccct tataaatcaa aagaatagac 12 0 
gaacaagagt ccactattaa agaacgtgga 18 0 
tcagggcgat ggcccactac gtgaaccatc 240 
ccgtaaagca ctaaatcgga accctaaagg 300 
gccggcgaac gtggcgagaa aggaagggaa 3 60 
ggcaagtgta gcggtcacgc tgcgcgtaac 420 
acagggcgcg tcccattcgc cattcaggct 48 0 
ggcctcttcg ctattacgcc agctggcgaa 54 0 
ggtaacgcca gggttttccc agtcacgacg 600 
cgactcacta tagggcgaat tgggtaccgg 660 
gcttgatatc gaattcagga gacacaactc 720 
actggtctgg ccacaactac attaatgaaa 780 
acattgccca agaataaaga atcgtttgtg 840 
cagaaaattt caagtcattt ttcattcagt 900 
agatcaccgt accttaatca aactcacaga 960 
caacacacag agtacacagt cctttctccc 1020 
gtaacagaca tattcttagg tgttatattc 1080 
tcagtgatat taataaactc cccgggcagc 1140 
tgagccacag gctgctgtcc aacttgcggt 1200 
gcctacatgg gggtagagtc ataatcgtgc 1260 
gcgcgaataa actgctgccg ccgccgctcc 1320 
tcctcagcga tgattcgcac cgcccgcagc 1380 
cgcaccctga tctcacttaa atcagcacag 1440 
aaaatcccac agtgcaaggc gctgtatcca 1500 
tggccatcat accacaagcg caggtagatt 1560 
ataaacatta cctcttttgg catgttgtaa 1620 
tgattaaaca tggcgccatc caccaccatc 1680 
gctatacact gcagggaacc gggactggaa 174 0 
ccatggatca tcatgctcgt catgatatca 1800 
cacttcctca ggattacaag ctcctcccgc 1860 
tcctgaatca gcgtaaatcc cacactgcag 1920 
attgtcaaag tgttacattc gggcagcagc 1980 
tctgtctcaa aaggaggtag acgatcccta 204 0 
cgtgttggtc gtagtgtcat gccaaatgga 2100 
aaaccaggtg cgggcgtgac aaacagatct 2160 
tgtgtagtag ttgtagtata tccactctct 2220 
ttctatgtaa actccttcat gcgccgctgc 2280 
cacacccagc caacctacac attcgttctg 2340 
tggaagaacc atgttttttt ttttattcca 2400 
ctattaagtg aacgcgctcc cctccggtgg 24 60 
taatggcatt tgtaagatgt tgcacaatgg 2520 
agtggacgta aaggctaaac ccttcagggt 2580 
caaccatgcc caaataattc tcatctcgcc 2640 
gaatattaag tccggccatt gtaaaaatct 2700 
agcagcgaat catgattgca aaaattcagg 27 60 
cggaacatta acaaaaatac cgcgatcccg 2820 
atcgtgcagg tctgcacgga ccagcgcggc 288 0 
acccacactg attatgacac gcatactcgg 2940 
agctttgttg catgggcggc gatataaaat 3000 
cctcgcgcaa aaaagaaagc acatcgtagt 3060 
ccggaaccac cacagaaaaa gacaccattt 3120 
taaacacaaa ataaaataac aaaaaaacat 3180 
aaaaacaacc cttataagca taagacggac 3240 
ctggtcaccg tgattaaaaa gcaccaccga 3300 
gtaagactcg gtaaacacat caggttgatt 3360 
agcccggggg aatacatacc cgcaggcgta 3420 
taacaaaatt aataggagag aaaaacacat 3480 



) 



aaacacctga aaaaccctcc tgcctaggca 
acagcgcttc acagcggcag cctaacagtc 
aaaaaacacc actcgacacg gcaccagctc 
gcagagcgag tatatatagg actaaaaaat 
ccagaaaacc gcacgcgaac ctacgcccag 
caaatcgtca cttccgtttt cccacgttac 
tctccgcaag aattgattgg ctccaattct 
ccggcttcca ttcaggtcga ggtggcccgg 
cagacaaggt atagggcggc gcctacaatc 
gcggcataaa tcgccgtgac gatcagcggt 
gcgagcgatc cttgaagctg tccctgatgg 
tgcaacgcgg gcatcccgat gccgccggaa 
cagcctcgcg tcgcgaacgc cagcaagacg 
ttcatccccg tggcccgttg ctcgcgtttg 
catgtcttta gttctatgat gacacaaacc 
aacacgcaga tgcagtcggg gcggcgcggt 
cgtgtggcct cgaacaccga gcgaccctgc 
ccgcagatcc cgggcaatga gatatgaaaa 
agtttctgat cgaaaagttc gacagcgtct 
aatctcgtgc tttcagcttc gatgtaggag 
gcgccgatgg tttctacaaa gatcgttatg 
cgattccgga agtgcttgac attggggaat 
gccgtgcaca gggtgtcacg ttgcaagacc 
agccggtcgc ggaggccatg gatgcgatcg 
tcggcccatt cggaccgcaa ggaatcggtc 
cgattgctga tccccatgtg tatcactggc 
ccgtcgcgca ggctctcgat gagctgatgc 
acctcgtgca cgcggatttc ggctccaaca 
cggtcattga ctggagcgag gcgatgttcg 
tcttctggag gccgtggttg gcttgtatgg 
atccggagct tgcaggatcg ccgcggctcc 
aactctatca gagcttggtt gacggcaatt 
gcgacgcaat cgtccgatcc ggagccggga 
gcgcggccgt ctggaccgat ggctgtgtag 
ccagcactcg tccgagggca aaggaatagg 
aggagacaat accggaagga acccgcgcta 
acgggtgttg ggtcgtttgt tcataaacgc 
gataccccac cgagacccca ttggggccaa 
caccccccaa gttcgggtga aggcccaggg 
gccatagcca ctggccccgt gggttaggga 
tcgtgggggt tattattttg ggcgttgcgt 
cagacccatg gtttttggat ggcctgggca 
ccgggcgtct gtggctgcca aacacccccg 
cgcccagtgc cgtcgaccgg tcatggctgc 
gccctgacgg gcttgtctgc tcccggcatc 
gagctgcatg tgtcagaggt tttcaccgtc 
taatcagcca taccacattt gtagaggttt 
cctgaacctg aaacataaaa tgaatgcaat 
taatggttac aaataaagca atagcatcac 
gcattctagt tgtggtttgt ccaaactcat 
gttctagagc ggccgccacc gcggtggagc 
atttcgagct tggcgtaatc atggtcatag 
acaattccac acaacatacg agccggaagc 
gtgagctaac tcacattaat tgcgttgcgc 
tcgtgccagc tgcattaatg aatcggccaa 
cgctcttccg cttcctcgct cactgactcg 
gtatcagctc actcaaaggc ggtaatacgg 
aagaacatgt gagcaaaagg ccagcaaaag 
gcgtttttcc ataggctccg cccccctgac 
aggtggcgaa acccgacagg actataaaga 
gtgcgctctc ctgttccgac cctgccgctt 
ggaagcgtgg cgctttctca tagctcacgc 
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aaatagcacc ctcccgctcc agaacaacat 354 0 
agccttacca gtaaaaaaga aaacctatta 3600 
aatcagtcac agtgtaaaaa agggccaagt 3660 
gacgtaacgg ttaaagtcca caaaaaacac 3720 
aaacgaaagc caaaaaaccc acaacttcct 3780 
gtaacttccc ggatccgcgg cattcacagt 3840 
tggagtggtg aatccgttag cgaggtgccg 3900 
ctccatgcac cgcgacgcaa cgcggggagg 3960 
catgccaacc cgttccatgt gctcgccgag 4020 
ccagtgatcg aagttaggct ggtaagagcc 4080 
tcgtcatcta cctgcctgga cagcatggcc 414 0 
gcgagaagaa tcataatggg gaaggccatc 4200 
tagcccagcg cgtcggccgc catgccctgc 4260 
ctggcggtgt ccccggaaga aatatatttg 4320 
ccgcccagcg tcttgtcatt ggcgaattcg 4 380 
cccaggtcca cttcgcatat taaggtgacg 4 440 
agcgacccgc ttaacagcgt caacagcgtg 4 500 
agcctgaact caccgcgacg tctgtcgaga 4 5 60 
ccgacctgat gcagctctcg gagggcgaag 4 620 
ggcgtggata tgtcctgcgg gtaaatagct 4 680 
tttatcggca ctttgcatcg gccgcgctcc 4740 
tcagcgagag cctgacctat tgcatctccc 4800 
tgcctgaaac cgaactgccc gctgttctgc 4860 
ctgcggccga tcttagccag acgagcgggt 4 920 
aatacactac atggcgtgat ttcatatgcg 4 980 
aaactgtgat ggacgacacc gtcagtgcgt 5040 
tttgggccga ggactgcccc gaagtccggc 5100 
atgtcctgac ggacaatggc cgcataacag 5160 
gggattccca atacgaggtc gccaacatct 5220 
agcagcagac gcgctacttc gagcggaggc 5280 
gggcgtatat gctccgcatt ggtcttgacc 5340 
tcgatgatgc agcttgggcg cagggtcgat 5400 
ctgtcgggcg tacacaaatc gcccgcagaa 54 60 
aagtactcgc cgatagtgga aaccgacgcc 5520 
ggagatgggg gaggctaact gaaacacgga 5580 
tgacggcaat aaaaagacag aataaaacgc 5640 
ggggttcggt cccagggctg gcactctgtc 5700 
tacgcccgcg tttcttcctt ttccccaccc 57 60 
ctcgcagcca acgtcggggc ggcaggccct 5820 
cggggtcccc catggggaat ggtttatggt 5880 
ggggtctggt ccacgactgg actgagcaga 594 0 
tggaccgcat gtactggcgc gacacgaaca 6000 
acccccaaaa accaccgcgc ggatttctgg 60 60 
gccccgacac ccgccaacac ccgctgacgc 6120 
cgcttacaga caagctgtga ccgtctccgg 6180 
atcaccgaaa cgcgcgaggc agccggatca 624 0 
tacttgcttt aaaaaacctc cccacctccc 6300 
tgttgttgtt aacttgttta ttgcagctta 6360 
aaatttcaca aataaagcat ttttttcact 6420 
caatgtatct tatcatgtct ggatccacta 64 80 
tccagctttt gttcccttta gtgagggtta 6540 
ctgtttcctg tgtgaaattg ttatccgctc 6600 
ataaagtgta aagcctgggg tgcctaatga 6660 
tcactgcccg ctttccagtc gggaaacctg 6720 
cgcgcgggga gaggcggttt gcgtattggg 678 0 
ctgcgctcgg tcgttcggct gcggcgagcg 6840 
ttatccacag aatcagggga taacgcagga 6900 
gccaggaacc gtaaaaaggc cgcgttgctg 6960 
gagcatcaca aaaatcgacg ctcaagtcag 7020 
taccaggcgt ttccccctgg aagctccctc 7080 
accggatacc tgtccgcctt tctcccttcg 7140 
tgtaggtatc tcagttcggt gtaggtcgtt 7200 
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cgctccaagc 
ggtaactatc 
actggtaaca 
tggcctaact 
gttaccttcg 
ggtggttttt 
cctttgatct 
ttggtcatga 
tttaaatcaa 
agtgaggcac 
gtcgtgtaga 
ccgcgagacc 
gccgagcgca 
cgggaagcta 
acaggcatcg 
cgatcaaggc 
cctccgatcg 
ctgcataatt 
tcaaccaagt 
atacgggata 
tcttcggggc 
actcgtgcac 
aaaacaggaa 
ctcatactct 
ggatacatat 
cgaaaagtgc 



tgggctgtgt 
gtcttgagtc 
ggattagcag 
acggctacac 
gaaaaagagt 
ttgtttgcaa 
tttctacggg 
gattatcaaa 
tctaaagtat 
ctatctcagc 
taactacgat 
cacgctcacc 
gaagtggtcc 
gagtaagtag 
tggtgtcacg 
gagttacatg 
ttgtcagaag 
ctcttactgt 
cattctgaga 
ataccgcgcc 
gaaaactctc 
ccaactgatc 
ggcaaaatgc 
tcctttttca 
ttgaatgtat 



gcacgaaccc 
caacccggta 
agcgaggtat 
tagaaggaca 
tggtagctct 
gcagcagatt 
gtctgacgct 
aaggatcttc 
atatgagtaa 
gatctgtcta 
acgggagggc 
ggctccagat 
tgcaacttta 
ttcgccagtt 
ctcgtcgttt 
atcccccatg 
taagttggcc 
catgccatcc 
atagtgtatg 
acatagcaga 
aaggatctta 
ttcagcatct 
cgcaaaaaag 
atattattga 
ttagaaaaat 



cccgttcagc 
agacacgact 
gtaggcggtg 
gtatttggta 
tgatccggca 
acgcgcagaa 
cagtggaacg 
acctagatcc 
acttggtctg 
tttcgttcat 
ttaccatctg 
ttatcagcaa 
tccgcctcca 
aatagtttgc 
ggtatggctt 
ttgtgcaaaa 
gcagtgttat 
gtaagatgct 
cggcgaccga 
actttaaaag 
ccgctgttga 
tttactttca 
ggaataaggg 
agcatttatc 
aaacaaatag 



ccgaccgctg 
tatcgccact 
ctacagagtt 
tctgcgctct 
aacaaaccac 
aaaaaggatc 
aaaactcacg 
ttttaaatta 
acagttacca 
ccatagttgc 
gccccagtgc 
taaaccagcc 
tccagtctat 
gcaacgttgt 
cattcagctc 
aagcggttag 
cactcatggt 
tttctgtgac 
gttgctcttg 
tgctcatcat 
gatccagttc 
ccagcgtttc 
cgacacggaa 
agggttattg 
gggttccgcg 



cgccttatcc 
ggcagcagcc 
cttgaagtgg 
gctgaagcca 
cgctggtagc 
tcaagaagat 
ttaagggatt 
aaaatgaagt 
atgcttaatc 
ctgactcccc 
tgcaatgata 
agccggaagg 
taattgttgc 
tgccattgct 
cggttcccaa 
ctccttcggt 
tatggcagca 
tggtgagtac 
cccggcgtca 
tggaaaacgt 
gatgtaaccc 
fcgggtgagca 
atgttgaata 
tctcatgagc 
cacatttccc 



7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8710 



<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 5 

atgggatcca agatgaagcg cgcaagaccg 30 



<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

cataacgcgg ccgcttcttt attcttgggc 30 



<210> 7 
<211> 7148 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 7 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
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ccgcatagtt aagccagtat ctgctccctg 
cgagcaaaat ttaagctaca acaaggcaag 
ttagggttag gcgttttgcg ctgcttcgcg 
gattattgac tagttattaa tagtaatcaa 
tggagttccg cgttacataa cttacggtaa 
cccgcccatt gacgtcaata atgacgtatg 
attgacgtca atgggtggac tatttacggt 
atcatatgcc aagtacgccc cctattgacg 
atgcccagta catgacctta tgggactttc 
tcgctattac catggtgatg cggttttggc 
actcacgggg atttccaagt ctccacccca 
aaaatcaacg ggactttcca aaatgtcgta 
gtaggcgtgt acggtgggag gtctatataa 
ctgcttactg gcttatcgaa attaatacga 
gagctcggat ccaagatgaa gcgcgcaaga 
ccatatgaca cggaaaccgg tcctccaact 
cccaatgggt ttcaagagag tccccctggg 
gttacctcca atggcatgct tgcgctcaaa 
ggcaacctta cctcccaaaa tgtaaccact 
aacataaacc tggaaatatc tgcacccctc 
gccgccgcac ctctaatggt cgcgggcaac 
accgtgcacg actccaaact tagcattgcc 
aagctagccc tgcaaacatc aggccccctc 
actgcctcac cccctctaac tactgccact 
atttatacac aaaatggaaa actaggacta 
gacctaaaca ctttgaccgt agcaactggt 
caaactaaag ttactggagc cttgggtttt 
gcaggaggac taaggattga ttctcaaaac 
tttgatgctc aaaaccaact aaatctaaga 
gcccacaact tggatattaa ctacaacaaa 
tccaaaaagc ttgaggttaa cctaagcact 
atagccatta atgcaggaga tgggcttgaa 
cccctcaaaa caaaaattgg ccatggccta 
aaactaggaa ctggccttag ttttgacagc 
aatgataagc taactttgtg gaccacacca 
gagaaagatg ctaaactcac tttggtctta 
gtttcagttt tggctgttaa aggcagtttg 
catcttatta taagatttga cgaaaatgga 
gaatattgga actttagaaa tggagatctt 
ggatttatgc ctaacctatc agcttatcca 
attgtcagtc aagtttactt aaacggagac 
ctaaacggta cacaggaaac aggagacaca 
tgggactggt ctggccacaa ctacattaat 
tcatacattg cccaagaata aagaagcggc 
ctatagtgtc acctaaatgc tagagctcgc 
cagccatctg ttgtttgccc ctcccccgtg 
actgtccttt cctaataaaa tgaggaaatt 
attctggggg gtggggtggg gcaggacagc 
catgctgggg atgcggtggg ctctatggct 
agggggtatc cccacgcgcc ctgtagcggc 
cgcagcgtga ccgctacact tgccagcgcc 
tcctttctcg ccacgttcgc cggctttccc 
gggttccgat ttagtgcttt acggcacctc 
tcacgtagtg ggccatcgcc ctgatagacg 
ttctttaata gtggactctt gttccaaact 
tcttttgatt tataagggat tttggggatt 
taacaaaaat ttaacgcgaa ttaattctgt 
ccccaggctc cccaggcagg cagaagtatg 
aggtgtggaa agtccccagg ctccccagca 
tagtcagcaa ccatagtccc gcccctaact 
tccgcccatt ctccgcccca tggctgacta 
gcctctgcct ctgagctatt ccagaagtag 



cttgtgtgtt ggaggtcgct gagtagtgcg 120 
gcttgaccga caattgcatg aagaatctgc 180 
atgtacgggc cagatatacg cgttgacatt 240 
ttacggggtc attagttcat agcccatata 300 
atggcccgcc tggctgaccg cccaacgacc 360 
ttcccatagt aacgccaata gggactttcc 420 
aaactgccca cttggcagta catcaagtgt 480 
tcaatgacgg taaatggccc gcctggcatt 540 
ctacttggca gtacatctac gtattagtca 600 
agtacatcaa tgggcgtgga tagcggtttg 660 
ttgacgtcaa tgggagtttg ttttggcacc 720 
acaactccgc cccattgacg caaatgggcg 780 
gcagagctct ctggctaact agagaaccca 84 0 
ctcactatag ggagacccaa gcttggtacc 900 
ccgtctgaag ataccttcaa ccccgtgtat 960 
gtgccttttc ttactcctcc ctttgtatcc 1020 
gtactctctt tgcgcctatc cgaacctcta 1080 
atgggcaacg gcctctctct ggacgaggcc 1140 
gtgagcccac ctctcaaaaa aaccaagtca 1200 
acagttacct cagaagccct aactgtggct 1260 
acactcacca tgcaatcaca ggccccgcta 1320 
acccaaggac ccctcacagt gtcagaagga 1380 
accaccaccg atagcagtac ccttactatc 1440 
ggtagcttgg gcattgactt gaaagagccc 1500 
aagtacgggg ctcctttgca tgtaacagac 1560 
ccaggtgtga ctattaataa tacttccttg 1620 
gattcacaag gcaatatgca acttaatgta 1680 
agacgcctta tacttgatgt tagttatccg 1740 
ctaggacagg gccctctttt tataaactca 1800 
ggcctttact tgtttacagc ttcaaacaat 18 60 
gccaaggggt tgatgtttga cgctacagcc 1920 
tttggttcac ctaatgcacc aaacacaaat 1980 
gaatttgatt caaacaaggc tatggttcct 2040 
acaggtgcca ttacagtagg aaacaaaaat 2100 
gctccatctc ctaactgtag actaaatgca 2160 
acaaaatgtg gcagtcaaat acttgctaca 2220 
gctccaatat ctggaacagt tcaaagtgct 2280 
gtgctactaa acaattcctt cctggaccca 2340 
actgaaggca cagcctatac aaacgctgtt 2400 
aaatctcacg gtaaaactgc caaaagtaac 2460 
aaaactaaac ctgtaacact aaccattaca 2520 
actccaagtg catactctat gtcattttca 2580 
gaaatatttg ccacatcctc ttacactttt 2640 
cgctcgagca tgcatctaga gggccctatt 2700 
tgatcagcct cgactgtgcc ttctagttgc 27 60 
ccttccttga ccctggaagg tgccactccc 2820 
gcatcgcatt gtctgagtag gtgtcattct 2880 
aagggggagg attgggaaga caatagcagg 2 94 0 
tctgaggcgg aaagaaccag ctggggctct 3000 
gcattaagcg cggcgggtgt ggtggttacg 3060 
ctagcgcccg ctcctttcgc tttcttccct 3120 
cgtcaagctc taaatcgggg catcccttta 3180 
gaccccaaaa aacttgatta gggtgatggt 3240 
gtttttcgcc ctttgacgtt ggagtccacg 3300 
ggaacaacac tcaaccctat ctcggtctat 3360 
tcggcctatt ggttaaaaaa tgagctgatt 3420 
ggaatgtgtg tcagttaggg tgtggaaagt 3480 
caaagcatgc atctcaatta gtcagcaacc 354 0 
ggcagaagta tgcaaagcat gcatctcaat 3600 
ccgcccatcc cgcccctaac tccgcccagt 3660 
atttttttta tttatgcaga ggccgaggcc 3720 
tgaggaggct tttttggagg cctaggcttt 3780 



1 



tgcaaaaagc tcccgggagc ttgtatatcc 
ggatcgtttc gcatgattga acaagatgga 
gagaggctat tcggctatga ctgggcacaa 
ttccggctgt cagcgcaggg gcgcccggtt 
ctgaatgaac tgcaggacga ggcagcgcgg 
tgcgcagctg tgctcgacgt tgtcactgaa 
gtgccggggc aggatctcct gtcatctcac 
gctgatgcaa tgcggcggct gcatacgctt 
gcgaaacatc gcatcgagcg agcacgtact 
gatctggacg aagagcatca ggggctcgcg 
cgcatgcccg acggcgagga tctcgtcgtg 
atggtggaaa atggccgctt ttctggattc 
cgctatcagg acatagcgtt ggctacccgt 
gctgaccgct tcctcgtgct ttacggtatc 
tatcgccttc ttgacgagtt cttctgagcg 
cgacgcccaa cctgccatca cgagatttcg 
gcttcggaat cgttttccgg gacgccggct 
tggagttctt cgcccacccc aacttgttta 
atagcatcac aaatttcaca aataaagcat 
ccaaactcat caatgtatct tatcatgtct 
cgtaatcatg gtcatagctg tttcctgtgt 
acatacgagc cggaagcata aagtgtaaag 
cattaattgc gttgcgctca ctgcccgctt 
attaatgaat cggccaacgc gcggggagag 
cctcgctcac tgactcgctg cgctcggtcg 
caaaggcggt aatacggtta tccacagaat 
caaaaggcca gcaaaaggcc aggaaccgta 
ggctccgccc ccctgacgag catcacaaaa 
cgacaggact ataaagatac caggcgtttc 
ttccgaccct gccgcttacc ggatacctgt 
tttctcaatg ctcacgctgt aggtatctca 
gctgtgtgca cgaacccccc gttcagcccg 
ttgagtccaa cccggtaaga cacgacttat 
ttagcagagc gaggtatgta ggcggtgcta 
gctacactag aaggacagta tttggtatct 
aaagagttgg tagctcttga tccggcaaac 
tttgcaagca gcagattacg cgcagaaaaa 
ctacggggtc tgacgctcag tggaacgaaa 
tatcaaaaag gatcttcacc tagatccttt 
aaagtatata tgagtaaact tggtctgaca 
tctcagcgat ctgtctattt cgttcatcca 
ctacgatacg ggagggctta ccatctggcc 
gctcaccggc tccagattta tcagcaataa 
gtggtcctgc aactttatcc gcctccatcc 
taagtagttc gccagttaat agtttgcgca 
tgtcacgctc gtcgtttggt atggcttcat 
ttacatgatc ccccatgttg tgcaaaaaag 
tcagaagtaa gttggccgca gtgttatcac 
ttactgtcat gccatccgta agatgctttt 
tctgagaata gtgtatgcgg cgaccgagtt 
ccgcgccaca tagcagaact ttaaaagtgc 
aactctcaag gatcttaccg ctgttgagat 
actgatcttc agcatctttt actttcacca 
aaaatgccgc aaaaaaggga ataagggcga 
tttttcaata ttattgaagc atttatcagg 
aatgtattta gaaaaataaa caaatagggg 
ctgacgtc 
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attttcggat ctgatcaaga gacaggatga 38 4 0 
ttgcacgcag gttctccggc cgcttgggtg 3900 
cagacaatcg gctgctctga tgccgccgtg 3960 
ctttttgtca agaccgacct gtccggtgcc 4020 
ctatcgtggc tggccacgac gggcgttcct 4080 
gcgggaaggg actggctgct attgggcgaa 4140 
cttgctcctg ccgagaaagt atccatcatg 4200 
gatccggcta cctgcccatt cgaccaccaa 4260 
cggatggaag ccggtcttgt cgatcaggat 4320 
ccagccgaac tgttcgccag gctcaaggcg 4380 
acccatggcg atgcctgctt gccgaatatc 44 4 0 
atcgactgtg gccggctggg tgtggcggac 4500 
gatattgctg aagagcttgg cggcgaatgg 4560 
gccgctcccg attcgcagcg catcgccttc 4 620 
ggactctggg gttcgaaatg accgaccaag 4 680 
attccaccgc cgccttctat gaaaggttgg 4740 
ggatgatcct ccagcgcggg gatctcatgc 4800 
ttgcagctta taatggttac aaataaagca 4 8 60 
ttttttcact gcattctagt tgtggtttgt 4 920 
gtataccgtc gacctctagc tagagcttgg 4 980 
gaaattgtta tccgctcaca attccacaca 5040 
cctggggtgc ctaatgagtg agctaactca 5100 
tccagtcggg aaacctgtcg tgccagctgc 5160 
gcggtttgcg tattgggcgc tcttccgctt 5220 
ttcggctgcg gcgagcggta tcagctcact 5280 
caggggataa cgcaggaaag aacatgtgag 534 0 
aaaaggccgc gttgctggcg tttttccata 5400 
atcgacgctc aagtcagagg tggcgaaacc 54 60 
cccctggaag ctccctcgtg cgctctcctg 5520 
ccgcctttct cccttcggga agcgtggcgc 5580 
gttcggtgta ggtcgttcgc tccaagctgg 5640 
accgctgcgc cttatccggt aactatcgtc 5700 
cgccactggc agcagccact ggtaacagga 57 60 
cagagttctt gaagtggtgg cctaactacg 5820 
gcgctctgct gaagccagtt accttcggaa 5880 
aaaccaccgc tggtagcggt ggtttttttg 5940 
aaggatctca agaagatcct ttgatctttt 6000 
actcacgtta agggattttg gtcatgagat 6060 
taaattaaaa atgaagtttt aaatcaatct 6120 
gttaccaatg cttaatcagt gaggcaccta 6180 
tagttgcctg actccccgtc gtgtagataa 6240 
ccagtgctgc aatgataccg cgagacccac 6300 
accagccagc cggaagggcc gagcgcagaa 6360 
agtctattaa ttgttgccgg gaagctagag 6420 
acgttgttgc cattgctaca ggcatcgtgg 6480 
tcagctccgg ttcccaacga tcaaggcgag 6540 
cggttagctc cttcggtcct ccgatcgttg 6600 
tcatggttat ggcagcactg cataattctc 6660 
ctgtgactgg tgagtactca accaagtcat 6720 
gctcttgccc ggcgtcaata cgggataata 6780 
tcatcattgg aaaacgttct tcggggcgaa 6840 
ccagttcgat gtaacccact cgtgcaccca 6900 
gcgtttctgg gtgagcaaaa acaggaaggc 6960 
cacggaaatg ttgaatactc atactcttcc 7020 
gttattgtct catgagcgga tacatatttg 7080 
ttccgcgcac atttccccga aaagtgccac 714 0 

7148 



<210> 8 
<211> 7469 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 8 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 18 0 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 24 0 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 48 0 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 54 0 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gcttggtacc 900 
gagctcggat ctgaattcga gctcgctgtt gggctcgcgg ttgaggacaa actcttcgcg 960 
gtctttccag tactcttgga tcggaaaccc gtcggcctcc gaacggtact ccgccaccga 1020 
gggacctgag cgagtccgca tcgaccggat cggaaaacct ctcgagaaag gcgtctaacc 1080 
agtcacagtc gcaaggtagg ctgagcaccg tggcgggcgg cagcgggtgg cggtcggggt 1140 
tgtttctggc ggaggtgctg ctgatgatgt aattaaagta ggcggtcttg agacggcgga 1200 
tggtcgaggt gaggtgtggc aggcttgaga tccaagatga agcgcgcaag accgtctgaa 12 60 
gataccttca accccgtgta tccatatgac acggaaaccg gtcctccaac tgtgcctttt 1320 
cttactcctc cctttgtatc ccccaatggg tttcaagaga gtccccctgg ggtactctct 1380 
ttgcgcctat ccgaacctct agttacctcc aatggcatgc ttgcgctcaa aatgggcaac 14 4 0 
ggcctctctc tggacgaggc cggcaacctt acctcccaaa atgtaaccac tgtgagccca 1500 
cctctcaaaa aaaccaagtc aaacataaac ctggaaatat ctgcacccct cacagttacc 1560 
tcagaagccc taactgtggc tgccgccgca cctctaatgg tcgcgggcaa cacactcacc 1620 
atgcaatcac aggccccgct aaccgtgcac gactccaaac ttagcattgc cacccaagga 168 0 
cccctcacag tgtcagaagg aaagctagcc ctgcaaacat caggccccct caccaccacc 17 4 0 
gatagcagta cccttactat cactgcctca ccccctctaa ctactgccac tggtagcttg 1800 
ggcattgact tgaaagagcc catttataca caaaatggaa aactaggact aaagtacggg 18 60 
gctcctttgc atgtaacaga cgacctaaac actttgaccg tagcaactgg tccaggtgtg 1920 
actattaata atacttcctt gcaaactaaa gttactggag ccttgggttt tgattcacaa 1980 
ggcaatatgc aacttaatgt agcaggagga ctaaggattg attctcaaaa cagacgcctt 204 0 
atacttgatg ttagttatcc gtttgatgct caaaaccaac taaatctaag actaggacag 2100 
ggccctcttt ttataaactc agcccacaac ttggatatta actacaacaa aggcctttac 2160 
ttgtttacag cttcaaacaa ttccaaaaag cttgaggtta acctaagcac tgccaagggg 2220 
ttgatgtttg acgctacagc catagccatt aatgcaggag atgggcttga atttggttca 228 0 
cctaatgcac caaacacaaa tcccctcaaa acaaaaattg gccatggcct agaatttgat 2340 
tcaaacaagg ctatggttcc taaactagga actggcctta gttttgacag cacaggtgcc 2400 
attacagtag gaaacaaaaa taatgataag ctaactttgt ggaccacacc agctccatct 24 60 
cctaactgta gactaaatgc agagaaagat gctaaactca ctttggtctt aacaaaatgt 2520 
ggcagtcaaa tacttgctac agtttcagtt ttggctgtta aaggcagttt ggctccaata 258 0 
tctggaacag ttcaaagtgc tcatcttatt ataagatttg acgaaaatgg agtgctacta 2640 
aacaattcct tcctggaccc agaatattgg aactttagaa atggagatct tactgaaggc 2700 
acagcctata caaacgctgt tggatttatg cctaacctat cagcttatcc aaaatctcac 27 60 
ggtaaaactg ccaaaagtaa cattgtcagt caagtttact taaacggaga caaaactaaa 2820 
cctgtaacac taaccattac actaaacggt acacaggaaa caggagacac aactccaagt 288 0 
gcatactcta tgtcattttc atgggactgg tctggccaca actacattaa tgaaatattt 294 0 
gccacatcct cttacacttt ttcatacatt gcccaagaat aaagaagcgg ccgctcgagc 3000 
atgcatctag agggccctat tctatagtgt cacctaaatg ctagagctcg ctgatcagcc 3060 
tcgactgtgc cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccttg 3120 
accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat 318 0 
tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag 324 0 
gattgggaag acaatagcag gcatgctggg gatgcggtgg gctctatggc ttctgaggcg 3300 
gaaagaacca gctggggctc tagggggtat ccccacgcgc cctgtagcgg cgcattaagc 3360 
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gcggcgggtg tggtggttac gcgcagcgtg 
gctcctttcg ctttcttccc ttcctttctc 
ctaaatcggg gcatcccttt agggttccga 
aaacttgatt agggtgatgg ttcacgtagt 
cctttgacgt tggagtccac gttctttaat 
ctcaacccta tctcggtcta ttcttttgat 
tggttaaaaa atgagctgat ttaacaaaaa 
gtcagttagg gtgtggaaag tccccaggct 
catctcaatt agtcagcaac caggtgtgga 
atgcaaagca tgcatctcaa ttagtcagca 
ccgcccctaa ctccgcccag ttccgcccat 
atttatgcag aggccgaggc cgcctctgcc 
ttttttggag gcctaggctt ttgcaaaaag 
tctgatcaag agacaggatg aggatcgttt 
ggttctccgg ccgcttgggt ggagaggcta 
ggctgctctg atgccgccgt gttccggctg 
aagaccgacc tgtccggtgc cctgaatgaa 
ctggccacga cgggcgttcc ttgcgcagct 
gactggctgc tattgggcga agtgccgggg 
gccgagaaag tatccatcat ggctgatgca 
acctgcccat tcgaccacca agcgaaacat 
gccggtcttg tcgatcagga tgatctggac 
ctgttcgcca ggctcaaggc gcgcatgccc 
gatgcctgct tgccgaatat catggtggaa 
ggccggctgg gtgtggcgga ccgctatcag 
gaagagcttg gcggcgaatg ggctgaccgc 
gattcgcagc gcatcgcctt ctatcgcctt 
ggttcgaaat gaccgaccaa gcgacgccca 
ccgccttcta tgaaaggttg ggcttcggaa 
tccagcgcgg ggatctcatg ctggagttct 
ataatggtta caaataaagc aatagcatca 
tgcattctag ttgtggtttg tccaaactca 
cgacctctag ctagagcttg gcgtaatcat 
atccgctcac aattccacac aacatacgag 
cctaatgagt gagctaactc acattaattg 
gaaacctgtc gtgccagctg cattaatgaa 
gtattgggcg ctcttccgct tcctcgctca 
ggcgagcggt atcagctcac tcaaaggcgg 
acgcaggaaa gaacatgtga gcaaaaggcc 
cgttgctggc gtttttccat aggctccgcc 
caagtcagag gtggcgaaac ccgacaggac 
gctccctcgt gcgctctcct gttccgaccc 
tcccttcggg aagcgtggcg ctttctcaat 
aggtcgttcg ctccaagctg ggctgtgtgc 
ccttatccgg taactatcgt cttgagtcca 
cagcagccac tggtaacagg attagcagag 
tgaagtggtg gcctaactac ggctacacta 
tgaagccagt taccttcgga aaaagagttg 
ctggtagcgg tggttttttt gtttgcaagc 
aagaagatcc tttgatcttt tctacggggt 
aagggatttt ggtcatgaga ttatcaaaaa 
aatgaagttt taaatcaatc taaagtatat 
gcttaatcag tgaggcacct atctcagcga 
gactccccgt cgtgtagata actacgatac 
caatgatacc gcgagaccca cgctcaccgg 
ccggaagggc cgagcgcaga agtggtcctg 
attgttgccg ggaagctaga gtaagtagtt 
ccattgctac aggcatcgtg gtgtcacgct 
gttcccaacg atcaaggcga gttacatgat 
ccttcggtcc tccgatcgtt gtcagaagta 
tggcagcact gcataattct cttactgtca 
gtgagtactc aaccaagtca ttctgagaat 



accgctacac ttgccagcgc cctagcgccc 3420 
gccacgttcg ccggctttcc ccgtcaagct 3480 
tttagtgctt tacggcacct cgaccccaaa 3540 
gggccatcgc cctgatagac ggtttttcgc 3600 
agtggactct tgttccaaac tggaacaaca 3660 
ttataaggga ttttggggat ttcggcctat 3720 
tttaacgcga attaattctg tggaatgtgt 3780 
ccccaggcag gcagaagtat gcaaagcatg 3840 
aagtccccag gctccccagc aggcagaagt 3900 
accatagtcc cgcccctaac tccgcccatc 3960 
tctccgcccc atggctgact aatttttttt 4020 
tctgagctat tccagaagta gtgaggaggc 4080 
ctcccgggag cttgtatatc cattttcgga 4140 
cgcatgattg aacaagatgg attgcacgca 4200 
ttcggctatg actgggcaca acagacaatc 42 60 
tcagcgcagg ggcgcccggt tctttttgtc 4320 
ctgcaggacg aggcagcgcg gctatcgtgg 4380 
gtgctcgacg ttgtcactga agcgggaagg 44 4 0 
caggatctcc tgtcatctca ccttgctcct 4500 
atgcggcggc tgcatacgct tgatccggct 4560 
cgcatcgagc gagcacgtac tcggatggaa 4 620 
gaagagcatc aggggctcgc ^ccagccgaa 4680 
gacggcgagg atctcgtcgt gacccatggc 4740 
aatggccgct tttctggatt catcgactgt 4800 
gacatagcgt tggctacccg tgatattgct 48 60 
ttcctcgtgc tttacggtat cgccgctccc 4920 
cttgacgagt tcttctgagc gggactctgg 4980 
acctgccatc acgagatttc gattccaccg 5040 
tcgttttccg ggacgccggc tggatgatcc 5100 
tcgcccaccc caacttgttt attgcagctt 5160 
caaatttcac aaataaagca tttttttcac 5220 
tcaatgtatc ttatcatgtc tgtataccgt 5280 
ggtcatagct gtttcctgtg tgaaattgtt 5340 
ccggaagcat aaagtgtaaa gcctggggtg 5400 
cgttgcgctc actgcccgct ttccagtcgg 5460 
tcggccaacg cgcggggaga ggcggtttgc 5520 
ctgactcgct gcgctcggtc gttcggctgc 5580 
taatacggtt atccacagaa tcaggggata 5640 
agcaaaaggc caggaaccgt aaaaaggccg 5700 
cccctgacga gcatcacaaa aatcgacgct 57 60 
tataaagata ccaggcgttt ccccctggaa 5820 
tgccgcttac cggatacctg tccgcctttc 5880 
gctcacgctg taggtatctc agttcggtgt 5940 
acgaaccccc cgttcagccc gaccgctgcg 6000 
acccggtaag acacgactta tcgccactgg 6060 
cgaggtatgt aggcggtgct acagagttct 6120 
gaaggacagt atttggtatc tgcgctctgc 6180 
gtagctcttg atccggcaaa caaaccaccg 6240 
agcagattac gcgcagaaaa aaaggatctc 6300 
ctgacgctca gtggaacgaa aactcacgtt 6360 
ggatcttcac ctagatcctt ttaaattaaa 6420 
atgagtaaac ttggtctgac agttaccaat 6480 
tctgtctatt tcgttcatcc atagttgcct 6540 
gggagggctt accatctggc cccagtgctg 6600 
ctccagattt atcagcaata aaccagccag 6660 
caactttatc cgcctccatc cagtctatta 6720 
cgccagttaa tagtttgcgc aacgttgttg 6780 
cgtcgtttgg tatggcttca ttcagctccg 684 0 
cccccatgtt gtgcaaaaaa gcggttagct 6900 
agttggccgc agtgttatca ctcatggtta 6960 
tgccatccgt aagatgcttt tctgtgactg 7020 
agtgtatgcg gcgaccgagt tgctcttgcc 7080 
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cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg 7140 

gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga 7200 

tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 7260 

ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 7320 

gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 7380 

tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 7440 

catttccccg aaaagtgcca cctgacgtc 7469 

<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

tgcttaagcg gccgcgaagg agaagtcc 

<210> 10 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 10 

ccgagctagc gactgaaaat gag 

<210> 11 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 11 

cctctcgaga gacagcaaga cac 



<210> 12 
<211> 11152 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 

<400> 12 ar . 
aagcttgggc agaaatggtt gaactcccga gagtgtccta cacctagggg agaagcagcc 60 
aaggggttgt ttcccaccaa ggacgacccg tctgcgcaca aacggatgag cccatcagac 120 
aaagacatat tcattctctg ctgcaaactt ggcatagctc tgctttgcct ggggctattg 180 
ggggaagttg cggttcgtgc tcgcagggct ctcacccttg actcttttaa tagctcttct 240 
gtgcaagatt acaatctaaa caattcggag aactcgacct tcctcctgag gcaaggacca 300 
cagccaactt cctcttacaa gccgcatcga ttttgtcctt cagaaataga aataagaatg 360 
cttgctaaaa attatatttt taccaataag accaatccaa taggtagatt attagttact 420 
atgttaagaa atgaatcatt atcttttagt actattttta ctcaaattca gaagttagaa 480 



atgggaatag aaaatagaaa gagacgctca 
ttgaccacag gcctagaagt aaaaaaggga 
aggtggtggc aaccagggac ttatagggga 
ttaccatata caggaagata tgacttaaat 
aaagtgttat atagatccct cccttttcgt 
tgtatgttgt ctcaagaaga aaaagacgac 
ctaggaacag gaatgcactt ttggggaaag 
ggactaatag aacattattc tgcaaaaact 
ttggcccaac cttgcggttc ccagggctta 
aacgaggatg tgagacaagt ggtttcctga 
gctctgagtg ttctattttc ctatgttctt 
ttatgtaaac caagatataa aagagtgctg 
cattcacctc ttgtgtgttt gtgtctgttc 
tcactttcca gagggtcccc ccgcagaccc 
gctggcgccc gaacagggac cctcggataa 
tgtttgtctt gtattgtctc tttcttgtct 
ccatagggac caagctagcg actgaaaatg 
accgaagaaa tggccgccag tcttttggac 
cttccacctc ctagccattt tgaaccacct 
acggcccccg aagatcccaa cgaggaggcg 
ttggcggtgc aggaagggat tgacttactc 
ccgcctcacc tttcccggca gcccgagcag 
atgccaaacc ttgtaccgga ggtgatcgat 
agtgacgacg aggatgaaga gggtgaggag 
cacggttgca ggtcttgtca ttatcaccgg 
tcgctttgct atatgaggac ctgtggcatg 
gtgggtgata gagtggtggg tttggtgtgg 
ggtttaaaga attttgtatt gtgatttttt 
agcccgagcc agaaccggag cctgcaagac 
tcctgagacg cccgacatca cctgtgtcta 
actccggtcc ttctaacaca cctcctgaga 
aaccagttgc cgtgagagtt ggtgggcgtc 
ttaacgagcc tgggcaacct ttggacttga 
acctgtgatt gcgtgtgtgg ttaacgcctt 
taaagggtga gataatgttt aacttgcatg 
atataatgcg ccgtgggcta atcttggtta 
tggaagattt ttctgctgtg cgtaacttgc 
tttggaggtt tctgtggggc tcatcccagg 
acaagtggga atttgaagag cttttgaaat 
tgggtcacca ggcgcttttc caagagaagg 
ggcgcgctgc ggctgctgtt gcttttttga 
cccatctgag cggggggtac ctgctggatt 
tgagacacaa gaatcgcctg ctactgttgt 
aggagcagca gcagcagcag gaggaagcca 
acccgagagc cggcctggac cctcgggaat 
agaactgaga cgcattttga caattacaga 
ggagcggggg gcttgtgagg ctacagagga 
cagacaccgt cctgagtgta ttacttttca 
tgatctgctg gcgcagaagt attccataga 
ggatgatttt gaggaggcta ttagggtata 
gtacaagatc agcaaacttg taaatatcag 
cgaggtggag atagatacgg aggatagggt 
gccgggggtg cttggcatgg acggggtggt 
ttttagcggt acggttttcc tggccaatac 
tgggtttaac aatacctgtg tggaagcctg 
ttactgctgc tggaaggggg tggtgtgtcg 
cctctttgaa aggtgtacct tgggtatcct 
tgtggcctcc gactgtggtt gcttcatgct 
catggtatgt ggcaactgcg aggacagggc 
ctgtcacctg ctgaagacca ttcacgtagc 
tgagcataac atactgaccc gctgttcctt 
accttaccaa tgcaatttga gtcacactaa 
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acctcaattg aagaacaggt gcaaggacta 540 
aaaaagagtg tttttgtcaa aataggagac 600 
ccttacatct acagaccaac agatgccccc 660 
tgggataggt gggttacagt caatggctat 720 
gaaagactcg ccagagctag acctccttgg 780 
atgaaacaac aggtacatga ttatatttat 840 
attttccata ccaaggaggg gacagtggct 900 
catggcatga gttattatga atagccttta 960 
agtaagtttt tggttacaaa ctgttcttaa 1020 
cttggtttgg tatcaaaggt tctgatctga 1080 
ttggaattta tccaaatctt atgtaaatgc 1140 
attttttgag taaacttgca acagtcctaa 1200 
gccatcccgt ctccgctcgt cacttatcct 12 60 
cggcgaccct caggtcggcc gactgcggca 1320 
gtgacccttg tctctatttc tactatttgg 1380 
ggctatcatc acaagagcgg aacggactca 1440 
agacatatta tctgccacgg aggtgttatt 1500 
cagctgatcg aagaggtact ggctgataat 1560 
acccttcacg aactgtatga tttagacgtg 1620 
gtttcgcaga tttttcccga ctctgtaatg 1680 
acttttccgc cggcgcccgg ttctccggag 17 4 0 
ccggagcaga gagccttggg tccggtttct 18 00 
cttacctgcc acgaggctgg ctttccaccc 18 60 
tttgtgttag attatgtgga gcaccccggg 1920 
aggaatacgg gggacccaga tattatgtgt 1980 
tttgtctaca gtaagtgaaa attatgggca 2040 
taattttttt tttaattttt acagttttgt 2100 
taaaaggtcc tgtgtctgaa cctgagcctg 2160 
ctacccgccg tcctaaaatg gcgcctgcta 2220 
gagaatgcaa tagtagtacg gatagctgtg 2280 
tacacccggt ggtcccgctg tgccccatta 2340 
gccaggctgt ggaatgtatc gaggacttgc 2400 
gctgtaaacg ccccaggcca taaggtgtaa 2460 
tgtttgctga atgagttgat gtaagtttaa 2520 
gcgtgttaaa tggggcgggg cttaaagggt 2580 
catctgacct catggaggct tgggagtgtt 264 0 
tggaacagag ctctaacagt acctcttggt 2700 
caaagttagt ctgcagaatt aaggaggatt 27 60 
cctgtggtga gctgtttgat tctttgaatc 2820 
tcatcaagac tttggatttt tccacaccgg 2880 
gttttataaa ggataaatgg agcgaagaaa 2 94 0 
ttctggccat gcatctgtgg agagcggttg 3000 
cttccgtccg cccggcgata ataccgacgg 3060 
ggcggcggcg gcaggagcag agcccatgga 3120 
gaatgttgta caggtggctg aactgtatcc 3180 
ggatgggcag gggctaaagg gggtaaagag 32 4 0 
ggctaggaat ctagctttta gcttaatgac 3300 
acagatcaag gataattgcg ctaatgagct 3360 
gcagctgacc acttactggc tgcagccagg 3420 
tgcaaaggtg gcacttaggc cagattgcaa 3480 
gaattgttgc tacatttctg ggaacggggc 3540 
ggcctttaga tgtagcatga taaatatgtg 3600 
tattatgaat gtaaggttta ctggccccaa 3660 
caaccttatc ctacacggtg taagcttcta 3720 
gaccgatgta agggttcggg gctgtgcctt 37 80 
ccccaaaagc agggcttcaa ttaagaaatg 3840 
gtctgagggt aactccaggg tgcgccacaa 3900 
agtgaaaagc gtggctgtga ttaagcataa 3960 
ctctcagatg ctgacctgct cggacggcaa 4020 
cagccactct cgcaaggcct ggccagtgtt 4080 
gcatttgggt aacaggaggg gggtgttcct 414 0 
gatattgctt gagcccgaga gcatgtccaa 4200 
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ggtgaacctg aacggggtgt ttgacatgac 
tgagacccgc accaggtgca gaccctgcga 
tgtgatgctg gatgtgaccg aggagctgag 
cgctgagttt ggctctagcg atgaagatac 
cttaagggtg ggaaagaata tataaggtgg 
agcagccgcc gccgccatga gcaccaactc 
gacaacgcgc atgcccccat gggccggggt 
tggtcgcccc gtcctgcccg caaactctac 
gccgttggag actgcagcct ccgccgccgc 
tgtgactgac tttgctttcc tgagcccgct 
ccgcgatgac aagttgacgg ctcttttggc 
tgtcgtttct cagcagctgt tggatctgcg 
ccctcccaat gcggtttaaa acataaataa 
gcaagtgtct tgctgtctct cgagggatct 
cataattgga caaactacct acagagattt 
gtgtataatg tgttaaacta ctgattctaa 
gaactgatga atgggagcag tggtggaatg 
aagaaatgcc atctagtgat gatgaggcta 
aaaagaagag aaaggtagaa gaccccaagg 
gtcatgctgt gtttagtaat agaactcttg 
aagctgcact gctatacaag aaaattatgg 
ataacagtta taatcataac atactgtttt 
ctattaataa ctatgctcaa aaattgtgta 
ataaggaata tttgatgtat agtgccttga 
gtagaggttt tacttgcttt aaaaaacctc 
atgaatgcaa ttgttgttgt taacttgttt 
aatagcatca caaatttcac aaataaagca 
tccaaactca tcaatgtatc ttatcatgtc 
gggtgtggaa agtccccagg ctccccagca 
tagtcagcaa ccaggtgtgg aaagtcccca 
atgcatctca attagtcagc aaccatagtc 
actccgccca gttccgccca ttctccgccc 
gaggccgagg ccgcctcggc ctctgagcta 
ggcctaggct tttgcaaaaa gcttggacac 
accactttat cccgcgtcag ggagaggcag 
actggttttt agtgcgccag atctctataa 
tttttaagcc gtagataaac aggctgggac 
ctgggacatg ttgcagatcc atgcacgtaa 
atggaaaggc attattgccg taagccgtgg 
tgaactgggt attcgtcatg tcgataccgt 
gcgcgagctt aaagtgctga aacgcgcaga 
tgacctggtg gataccggtg gtactgcggt 
ctttgtcacc atcttcgcaa aaccggctgg 
tatcccgcaa gatacctgga ttgaacagcc 
aatctccggt cgctaatctt ttcaacgcct 
caggcgggtt acaatagttt ccagtaagta 
acctgagcga aaccctgttc aaaccccgct 
gccgctttaa tcacggcgca caaccgcctg 
ctcactggta tcgcatgatt aaccgtctga 
aaatcctcga cgtccaggca cgtattgtga 
tatacgatac ggtgattggc taccgtggcg 
tttgtgaagg aaccttactt ctgtggtgtg 
taaagctcta aggtaaatat aaaattttta 
attgtttgtg tattttagat tccaacctat 
gcctttaatg aggaaaacct gttttgctca 
actgctgact ctcaacattc tactcctcca 
gactttcctt cagaattgct aagttttttg 
gcttgctttg ctatttacac cacaaaggaa 
gaaaaatatt ctgtaacctt tataagtagg 
tttcttactc cacacaggca tagagtgtct 
acctttagct ttttaatttg taaaggggtt 
actagagatc ataatcagcc ataccacatt 



catgaagatc tggaaggtgc tgaggtacga 4260 
gtgtggcggt aaacatatta ggaaccagcc 4320 
gcccgatcac ttggtgctgg cctgcacccg 4380 
agattgaggt actgaaatgt gtgggcgtgg 44 40 
gggtcttatg tagttttgta tctgttttgc 4500 
gtttgatgga agcattgtga gctcatattt 4 5 60 
gcgtcagaat gtgatgggct ccagcattga 4 620 
taccttgacc tacgagaccg tgtctggaac 4 680 
ttcagccgct gcagccaccg cccgcgggat 4740 
tgcaagcagt gcagcttccc gttcatccgc 4800 
acaattggat tctttgaccc gggaacttaa 4860 
ccagcaggtt tctgccctga aggcttcctc 4 920 
aaaaccagac tctgtttgga tttggatcaa 4 980 
ttgtgaagga accttacttc tgtggtgtga 5040 
aaagctctaa ggtaaatata aaatttttaa 5100 
ttgtttgtgt attttagatt ccaacctatg 5160 
cctttaatga ggaaaacctg ttttgctcag 5220 
ctgctgactc tcaacattct actcctccaa 5280 
actttccttc agaattgcta agttttttga 5340 
cttgctttgc tatttacacc acaaaggaaa 5400 
aaaaatattc tgtaaccttt ataagtaggc 5460 
ttcttactcc acacaggcat agagtgtctg 5520 
cctttagctt tttaatttgt aaaggggtta 5580 
ctagagatca taatcagcca taccacattt 5640 
ccacacctcc ccctgaacct gaaacataaa 5700 
attgcagctt ataatggtta caaataaagc 57 60 
tttttttcac tgcattctag ttgtggtttg 5820 
tggatccggc tgtggaatgt gtgtcagtta 5880 
ggcagaagta tgcaaagcat gcatctcaat 5940 
ggctccccag caggcagaag tatgcaaagc 6000 
ccgcccctaa ctccgcccat cccgccccta 6060 
catggctgac taattttttt tatttatgca 6120 
ttccagaagt agtgaggagg cttttttgga 6180 
aagacaggct tgcgagatat gtttgagaat 6240 
tgcgtaaaaa gacgcggact catgtgaaat 6300 
tctcgcgcaa cctattttcc cctcgaacac 6360 
acttcacatg agcgaaaaat acatcgtcac 6420 
actcgcaagc cgactgatgc cttctgaaca 6480 
cggtctggta ccgggtgcgt tactggcgcg 6540 
ttgtatttcc agctacgatc acgacaacca 6600 
aggcgatggc gaaggcttca tcgttattga 6660 
tgcgattcgt gaaatgtatc caaaagcgca 6720 
tcgtccgctg gttgatgact atgttgttga 6780 
gtgggatatg ggcgtcgtat tcgtcccgcc 68 4 0 
ggcactgccg ggcgttgttc tttttaactt 6900 
ttctggaggc tgcatccatg acacaggcaa 6960 
ttaaacatcc tgaaacctcg acgctagtcc 7020 
tgcagtcggc ccttgatggt aaaaccatcc 7080 
tgtggatctg gcgcggcatt gacccacgcg 7140 
tgagcgatgc cgaacgtacc gacgatgatt 7200 
gcaactggat ttatgagtgg gccccggatc 72 60 
acataattgg acaaactacc tacagagatt 7320 
agtgtataat gtgttaaact actgattcta 7380 
ggaactgatg aatgggagca gtggtggaat 74 4 0 
gaagaaatgc catctagtga tgatgaggct 7500 
aaaaagaaga gaaaggtaga agaccccaag 75 60 
agtcatgctg tgtttagtaa tagaactctt 7620 
aaagctgcac tgctatacaa gaaaattatg 7680 
cataacagtt ataatcataa catactgttt 7740 
gctattaata actatgctca aaaattgtgt 7800 
aataaggaat atttgatgta tagtgccttg 78 60 
tgtagaggtt ttacttgctt taaaaaacct 7920 



1 
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cccacacctc 
tattgcagct 
atttttttca 
ctggatcccc 
acattccaat 
acaaaaagga 
atctgggaag 
caacagcaga 
gaagcactgt 
tgtaggttcc 
ctggataagc 
caactgtagc 
acacaccctg 
gaatgggttt 
tagcagttaC 
cacaggttaa 
gatacgccta 
cacttttcgg 
tatgtatccg 
gagtatgagt 
tcctgttttt 
tgcacgagtg 
ccccgaagaa 
atcccgtgtt 
cttggttgag 
attatgcagt 
gatcggagga 
ccttgatcgt 
gatgcctgca 
agcttcccgg 
gcgctcggcc 
gtctcgcggt 
ctacacgacg 
tgcctcactg 
tgatttaaaa 
catgaccaaa 
gatcaaagga 
aaaaccaccg 
gaaggtaact 
gttaggccac 
gttaccagtg 
atagttaccg 
cttggagcga 
cacgcttccc 
agagcgcacg 
tcgccacctc 
gaaaaacgcc 
catgttcttt 
agctgatacc 
ggaagagcgc 
atggtgcact 
ctatcgctac 
ccctgacggg 
agctgcatgt 



cccctgaacc 
tataatggtt 
ctgcattcta 
aggaagctcc 
cataggctgc 
aattgggtag 
tcccttccac 
aacatacaag 
ggttgctgtg 
aaaatatcta 
attatcctta 
attttttggg 
cagctccaaa 
tccagcacca 
cccaataacc 
gtcctcattt 
tttttatagg 
ggaaatgtgc 
ctcatgagac 
attcaacatt 
gctcacccag 
ggttacatcg 
cgttttccaa 
gacgccgggc 
tactcaccag 
gctgccataa 
ccgaaggagc 
tgggaaccgg 
gcaatggcaa 
caacaattaa 
cttccggctg 
atcattgcag 
gggagtcagg 
attaagcatt 
cttcattttt 
atcccttaac 
tcttcttgag 
ctaccagcgg 
ggcttcagca 
cacttcaaga 
gctgctgcca 
gataaggcgc 
acgacctaca 
gaagggagaa 
agggagcttc 
tgacttgagc 
agcaacgcgg 
cctgcgttat 
gctcgccgca 
ctgatgcggt 
ctcagtacaa 
gtgactgggt 
cttgtctgct 
gtcagaggtt 



tgaaacataa 
acaaataaag 
gttgtggttt 
tctgtgtcct 
ccatccaccc 
gggtttttca 
tgctgtgttc 
ctgtcagctt 
ttagtaatgt 
gtgttttcat 
tccaaaacag 
gttacagttt 
ggttccccac 
ttttcatgag 
tcagttttaa 
aaattaggca 
ttaatgtcat 
gcggaacccc 
aataaccctg 
tccgtgtcgc 
aaacgctggt 
aactggatct 
tgatgagcac 
aagagcaact 
tcacagaaaa 
ccatgagtga 
taaccgcttt 
agctgaatga 
caacgttgcg 
tagactggat 
gctggtttat 
cactggggcc 
caactatgga 
ggtaactgtc 
aatttaaaag 
gtgagttttc 
atcctttttt 
tggtttgttt 
gagcgcagat 
actctgtagc 
gtggcgataa 
agcggtcggg 
ccgaactgag 
aggcggacag 
cagggggaaa 
gtcgattttt 
cctttttacg 
cccctgattc 
gccgaacgac 
attttctcct 
tctgctctga 
catggctgcg 
cccggcatcc 
ttcaccgtca 



aatgaatgca 
caatagcatc 
gtccaaactc 
cataaaccct 
tctgtgtcct 
cagaccgctt 
cagaagtgtt 
tgcacaaggg 
gcaaaacagg 
ttttacttgg 
ccttgtggtc 
gagcaggata 
caacagcaaa 
ttttttgtgt 
cagtaacagc 
aaggaattct 
gataataatg 
tatttgttta 
ataaatgctt 
ccttattccc 
gaaagtaaaa 
caacagcggt 
ttttaaagtt 
cggtcgccgc 
gcatcttacg 
taacactgcg 
tttgcacaac 
agccatacca 
caaactatta 
ggaggcggat 
tgctgataaa 
agatggtaag 
tgaacgaaat 
agaccaagtt 
gatctaggtg 
gttccactga 
tctgcgcgta 
gccggatcaa 
accaaatact 
accgcctaca 
gtcgtgtctt 
ctgaacgggg 
atacctacag 
gtatccggta 
cgcctggtat 
gtgatgctcg 
gttcctggcc 
tgtggataac 
cgagcgcagc 
tacgcatctg 
tgccgcatag 
ccccgacacc 
gcttacagac 
tcaccgaaac 



attgttgttg 
acaaatttca 
atcaatgtat 
aacctcctct 
cctgttaatt 
tctaagggta 
ggtaaacagc 
cccaacaccc 
aggcacattt 
atcaggaacc 
agtgttcatc 
tttggtcctg 
aaaatgaaaa 
ccctgaatgc 
ttcccacatc 
tgaagacgaa 
gtttcttaga 
tttttctaaa 
caataatatt 
ttttttgcgg 
gatgctgaag 
aagatccttg 
ctgctatgtg 
atacactatt 
gatggcatga 
gccaacttac 
atgggggatc 
aacgacgagc 
actggcgaac 
aaagttgcag 
tctggagccg 
ccctcccgta 
agacagatcg 
tactcatata 
aagatccttt 
gcgtcagacc 
atctgctgct 
gagctaccaa 
gtccttctag 
tacctcgctc 
accgggttgg 
ggttcgtgca 
cgtgagctat 
agcggcaggg 
ctttatagtc 
tcaggggggc 
ttttgctggc 
cgtattaccg 
gagtcagtga 
tgcggtattt 
ttaagccagt 
cgccaacacc 
aagctgtgac 
gcgcgaggca 



ttaacttgtt 
caaataaagc 
cttatcatgt 
acttgagagg 
aggtcactta 
attttaaaat 
ccacaaatgt 
tgctcatcaa 
tccccacctg 
cagcactcca 
tgctgactgt 
tagtttgcta 
tttgaccctt 
aagtttaaca 
aaaatatttc 
agggcctcgt 
cgtcaggtgg 
tacattcaaa 
gaaaaaggaa 
cattttgcct 
atcagttggg 
agagttttcg 
gcgcggtatt 
ctcagaatga 
cagtaagaga 
ttctgacaac 
atgtaactcg 
gtgacaccac 
tacttactct 
gaccacttct 
gtgagcgtgg 
tcgtagttat 
ctgagatagg 
tactttagat 
ttgataatct 
ccgtagaaaa 
tgcaaacaaa 
ctctttttcc 
tgtagccgta 
tgctaatcct 
actcaagacg 
cacagcccag 
gagaaagcgc 
tcggaacagg 
ctgtcgggtt 
ggagcctatg 
cttttgctca 
cctttgagtg 
gcgaggaagc 
cacaccgcat 
atacactccg 
cgctgacgcg 
cgtctccggg 
gc 



7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11152 



<210> 13 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: primer 
<400> 13 

gacggatcgg gagatctcc 



<210> 14 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 14 

ccgcctcaga agccatagag cc 



<210> 15 
<211> 14455 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: plasmid 



<400> 15 

aagcttgggc agaaatggtt gaactcccga 
aaggggttgt ttcccaccaa ggacgacccg 
aaagacatat tcattctctg ctgcaaactt 
ggggaagttg cggttcgtgc tcgcagggct 
gtgcaagatt acaatctaaa caattcggag 
cagccaactt cctcttacaa gccgcatcga 
cttgctaaaa attatatttt taccaataag 
atgttaagaa atgaatcatt atcttttagt 
atgggaatag aaaatagaaa gagacgctca 
ttgaccacag gcctagaagt aaaaaaggga 
aggtggtggc aaccagggac ttatagggga 
ttaccatata caggaagata tgacttaaat 
aaagtgttat atagatccct cccttttcgt 
tgtatgttgt ctcaagaaga aaaagacgac 
ctaggaacag gaatgcactt ttggggaaag 
ggactaatag aacattattc tgcaaaaact 
ttggcccaac cttgcggttc ccagggctta 
aacgaggatg tgagacaagt ggtttcctga 
gctctgagtg ttctattttc ctatgttctt 
ttatgtaaac caagatataa aagagtgctg 
cattcacctc ttgtgtgttt gtgtctgttc 
tcactttcca gagggtcccc ccgcagaccc 
gctggcgccc gaacagggac cctcggataa 
tgtttgtctt gtattgtctc tttcttgtct 
ccatagggac caagctagcg actgaaaatg 
accgaagaaa tggccgccag tcttttggac 
cttccacctc ctagccattt tgaaccacct 
acggcccccg aagatcccaa cgaggaggcg 
ttggcggtgc aggaagggat tgacttactc 
ccgcctcacc tttcccggca gcccgagcag 
atgccaaacc ttgtaccgga ggtgatcgat 
agtgacgacg aggatgaaga gggtgaggag 
cacggttgca ggtcttgtca ttatcaccgg 
tcgctttgct* atatgaggac ctgtggcatg 
gtgggtgata gagtggtggg tttggtgtgg 



gagtgtccta cacctagggg agaagcagcc 60 
tctgcgcaca aacggatgag cccatcagac 120 
ggcatagctc tgctttgcct ggggctattg 180 
ctcacccttg actcttttaa tagctcttct 240 
aactcgacct tcctcctgag gcaaggacca 300 
ttttgtcctt cagaaataga aataagaatg 360 
accaatccaa taggtagatt attagttact 420 
actattttta ctcaaattca gaagttagaa 480 
acctcaattg aagaacaggt gcaaggacta 540 
aaaaagagtg tttttgtcaa aataggagac 600 
ccttacatct acagaccaac agatgccccc 660 
tgggataggt gggttacagt caatggctat 720 
gaaagactcg ccagagctag acctccttgg 780 
atgaaacaac aggtacatga ttatatttat 840 
attttccata ccaaggaggg gacagtggct 900 
catggcatga gttattatga atagccttta 960 
agtaagtttt tggttacaaa ctgttcttaa 1020 
cttggtttgg tatcaaaggt tctgatctga 1080 
ttggaattta tccaaatctt atgtaaatgc 114 0 
attttttgag taaacttgca acagtcctaa 1200 
gccatcccgt ctccgctcgt cacttatcct 1260 
cggcgaccct caggtcggcc gactgcggca 1320 
gtgacccttg tctctatttc tactatttgg 1380 
ggctatcatc acaagagcgg aacggactca 144 0 
agacatatta tctgccacgg aggtgttatt 1500 
cagctgatcg aagaggtact ggctgataat 1560 
acccttcacg aactgtatga tttagacgtg 1620 
gtttcgcaga tttttcccga ctctgtaatg 1680 
acttttccgc cggcgcccgg ttctccggag 174 0 
ccggagcaga gagccttggg tccggtttct 1800 
cttacctgcc acgaggctgg ctttccaccc 18 60 
tttgtgttag attatgtgga gcaccccggg 1920 
aggaatacgg gggacccaga tattatgtgt 1980 
tttgtctaca gtaagtgaaa attatgggca 2040 
taattttttt tttaattttt acagttttgt 2100 
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ggtttaaaga attttgtatt gtgatttttt 
agcccgagcc agaaccggag cctgcaagac 
tcctgagacg cccgacatca cctgtgtcta 
actccggtcc ttctaacaca cctcctgaga 
aaccagttgc cgtgagagtt ggtgggcgtc 
ttaacgagcc tgggcaacct ttggacttga 
acctgtgatt gcgtgtgtgg ttaacgcctt 
taaagggtga gataatgttt aacttgcatg 
atataatgcg ccgtgggcta atcttggtta 
tggaagattt ttctgctgtg cgtaacttgc 
tttggaggtt tctgtggggc tcatcccagg 
acaagtggga atttgaagag cttttgaaat 
tgggtcacca ggcgcttttc caagagaagg 
ggcgcgctgc ggctgctgtt gcttttttga 
cccatctgag cggggggtac ctgctggatt 
tgagacacaa gaatcgcctg ctactgttgt 
aggagcagca gcagcagcag gaggaagcca 
acccgagagc cggcctggac cctcgggaat 
agaactgaga cgcattttga caattacaga 
ggagcggggg gcttgtgagg ctacagagga 
cagacaccgt cctgagtgta ttacttttca 
tgatctgctg gcgcagaagt attccataga 
ggatgatttt gaggaggcta ttagggtata 
gtacaagatc agcaaacttg taaatatcag 
cgaggtggag atagatacgg aggatagggt 
gccgggggtg cttggcatgg acggggtggt 
ttttagcggt acggttttcc tggccaatac 
tgggtttaac aatacctgtg tggaagcctg 
ttactgctgc tggaaggggg tggtgtgtcg 
cctctttgaa aggtgtacct tgggtatcct 
tgtggcctcc gactgtggtt gcttcatgct 
catggtatgt ggcaactgcg aggacagggc 
ctgtcacctg ctgaagacca ttcacgtagc 
tgagcataac atactgaccc gctgttcctt 
accttaccaa tgcaatttga gtcacactaa 
ggtgaacctg aacggggtgt ttgacatgac 
tgagacccgc accaggtgca gaccctgcga 
tgtgatgctg gatgtgaccg aggagctgag 
cgctgagttt ggctctagcg atgaagatac 
cttaagggtg ggaaagaata tataaggtgg 
agcagccgcc gccgccatga gcaccaactc 
gacaacgcgc atgcccccat gggccggggt 
tggtcgcccc gtcctgcccg caaactctac 
gccgttggag actgcagcct ccgccgccgc 
tgtgactgac tttgctttcc tgagcccgct 
ccgcgatgac aagttgacgg ctcttttggc 
tgtcgtttct cagcagctgt tggatctgcg 
ccctcccaat gcggtttaaa acataaataa 
gcaagtgtct tgctgtctct cgagggatct 
cataattgga caaactacct acagagattt 
gtgtataatg tgttaaacta ctgattctaa 
gaactgatga atgggagcag tggtggaatg 
aagaaatgcc atctagtgat gatgaggcta 
aaaagaagag aaaggtagaa gaccccaagg 
gtcatgctgt gtttagtaat agaactcttg 
aagctgcact gctatacaag aaaattatgg 
ataacagtta taatcataac atactgtttt 
ctattaataa ctatgctcaa aaattgtgta 
ataaggaata tttgatgtat agtgccttga 
gtagaggttt tacttgcttt aaaaaacctc 
atgaatgcaa ttgttgttgt taacttgttt 
aatagcatca caaatttcac aaataaagca 



taaaaggtcc tgtgtctgaa cctgagcctg 2160 
ctacccgccg tcctaaaatg gcgcctgcta 2220 
gagaatgcaa tagtagtacg gatagctgtg 2280 
tacacccggt ggtcccgctg tgccccatta 2340 
gccaggctgt ggaatgtatc gaggacttgc 2400 
gctgtaaacg ccccaggcca taaggtgtaa 2460 
tgtttgctga atgagttgat gtaagtttaa 2520 
gcgtgttaaa tggggcgggg cttaaagggt 2580 
catctgacct catggaggct tgggagtgtt 264 0 
tggaacagag ctctaacagt acctcttggt 2700 
caaagttagt ctgcagaatt aaggaggatt 2760 
cctgtggtga gctgtttgat tctttgaatc 2820 
tcatcaagac tttggatttt tccacaccgg 2880 
gttttataaa ggataaatgg agcgaagaaa 2940 
ttctggccat gcatctgtgg agagcggttg 3000 
cttccgtccg cccggcgata ataccgacgg 3060 
ggcggcggcg gcaggagcag agcccatgga 3120 
gaatgttgta caggtggctg aactgtatcc 3180 
ggatgggcag gggctaaagg gggtaaagag 3240 
ggctaggaat ctagctttta gcttaatgac 3300 
acagatcaag gataattgcg ctaatgagct 3360 
gcagctgacc acttactggc £gcagccagg 3420 
tgcaaaggtg gcacttaggc cagattgcaa 3480 
gaattgttgc tacatttctg ggaacggggc 3540 
ggcctttaga tgtagcatga taaatatgtg 3600 
tattatgaat gtaaggttta ctggccccaa 3660 
caaccttatc ctacacggtg taagcttcta 3720 
gaccgatgta agggttcggg gctgtgcctt 3780 
ccccaaaagc agggcttcaa ttaagaaatg 3840 
gtctgagggt aactccaggg tgcgccacaa 3900 
agtgaaaagc gtggctgtga ttaagcataa 3960 
ctctcagatg ctgacctgct cggacggcaa 4020 
cagccactct cgcaaggcct ggccagtgtt 4080 
gcatttgggt aacaggaggg gggtgttcct 4140 
gatattgctt gagcccgaga gcatgtccaa 4200 
catgaagatc tggaaggtgc tgaggtacga 4260 
gtgtggcggt aaacatatta ggaaccagcc 4320 
gcccgatcac ttggtgctgg cctgcacccg 4380 
agattgaggt actgaaatgt gtgggcgtgg 4440 
gggtcttatg tagttttgta tctgttttgc 4500 
gtttgatgga agcattgtga gctcatattt 4560 
gcgtcagaat gtgatgggct ccagcattga 4 620 
taccttgacc tacgagaccg tgtctggaac 4 680 
ttcagccgct gcagccaccg cccgcgggat 4740 
tgcaagcagt gcagcttccc gttcatccgc 4 800 
acaattggat tctttgaccc gggaacttaa 4860 
ccagcaggtt tctgccctga aggcttcctc 4 920 
aaaaccagac tctgtttgga tttggatcaa 4980 
ttgtgaagga accttacttc tgtggtgtga 5040 
aaagctctaa ggtaaatata aaatttttaa 5100 
ttgtttgtgt attttagatt ccaacctatg 5160 
cctttaatga ggaaaacctg ttttgctcag 5220 
ctgctgactc tcaacattct actcctccaa 5280 
actttccttc agaattgcta agttttttga 5340 
cttgctttgc tatttacacc acaaaggaaa 5400 
aaaaatattc tgtaaccttt ataagtaggc 5460_ 
ttcttactcc acacaggcat agagtgtctg 5520 
cctttagctt tttaatttgt aaaggggtta 5580 
ctagagatca taatcagcca taccacattt 564 0 
ccacacctcc ccctgaacct gaaacataaa 5700 
attgcagctt ataatggtta caaataaagc 5760 
tttttttcac tgcattctag ttgtggtttg 5820 
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tccaaactca tcaatgtatc 
gggtgtggaa agtccccagg 
tagtcagcaa ccaggtgtgg 
atgcatctca attagtcagc 
actccgccca gttccgccca 
gaggccgagg ccgcctcggc 
ggcctaggct tttgcaaaaa 
accactttat cccgcgtcag 
actggttttt agtgcgccag 
tttttaagcc gtagataaac 
ctgggacatg ttgcagatcc 
atggaaaggc attattgccg 
tgaactgggt attcgtcatg 
gcgcgagctt aaagtgctga 
tgacctggtg gataccggtg 
ctttgtcacc atcttcgcaa 
tatcccgcaa gatacctgga 
aatctccggt cgctaatctt 
caggcgggtt acaatagttt 
acctgagcga aaccctgttc 
gccgctttaa tcacggcgca 
ctcactggta tcgcatgatt 
aaatcctcga cgtccaggca 
tatacgatac ggtgattggc 
tttgtgaagg aaccttactt 
taaagctcta aggtaaatat 
attgtttgtg tattttagat 
gcctttaatg aggaaaacct 
actgctgact ctcaacattc 
gactttcctt cagaattgct 
gcttgctttg ctatttacac 
gaaaaatatt ctgtaacctt 
tttcttactc cacacaggca 
acctttagct ttttaatttg 
actagagatc ataatcagcc 
cccacacctc cccctgaacc 
tattgcagct tataatggtt 
atttttttca ctgcattcta 
ctggatcccc aggaagctcc 
acattccaat cataggctgc 
acaaaaagga aattgggtag 
atctgggaag tcccttccac 
caacagcaga aacatacaag 
gaagcactgt ggttgctgtg 
tgtaggttcc aaaatatcta 
ctggataagc attatcctta 
caactgtagc attttttggg 
acacaccctg cagctccaaa 
gaatgggttt tccagcacca 
tagcagttac cccaataacc 
cacaggttaa gtcctcattt 
gatacgccta tttttatagg 
cacttttcgg ggaaatgtgc 
tatgtatccg ctcatgagac 
gagtatgagt attcaacatt 
tcctgttttt gctcacccag 
tgcacgagtg ggttacatcg 
ccccgaagaa cgttttccaa 
atcccgtgtt gacgccgggc 
cttggttgag tactcaccag 
attatgcagt gctgccataa 
gatcggagga ccgaaggagc 



ttatcatgtc tggatccggc 
ctccccagca ggcagaagta 
aaagtcccca ggctccccag 
aaccatagtc ccgcccctaa 
ttctccgccc catggctgac 
ctctgagcta ttccagaagt 
gcttggacac aagacaggct 
ggagaggcag tgcgtaaaaa 
atctctataa tctcgcgcaa 
aggctgggac acttcacatg 
atgcacgtaa actcgcaagc 
taagccgtgg cggtctggta 
tcgataccgt ttgtatttcc 
aacgcgcaga aggcgatggc 
gtactgcggt tgcgattcgt 
aaccggctgg tcgtccgctg 
ttgaacagcc gtgggatatg 
ttcaacgcct ggcactgccg 
ccagtaagta ttctggaggc 
aaaccccgct ttaaacatcc 
caaccgcctg tgcagtcggc 
aaccgtctga tgtggatctg 
cgtattgtga tgagcgatgc 
taccgtggcg gcaactggat 
ctgtggtgtg acataattgg 
aaaattttta agtgtataat 
tccaacctat ggaactgatg 
gttttgctca gaagaaatgc 
tactcctcca aaaaagaaga 
aagttttttg agtcatgctg 
cacaaaggaa aaagctgcac 
tataagtagg cataacagtt 
tagagtgtct gctattaata 
taaaggggtt aataaggaat 
ataccacatt tgtagaggtt 
tgaaacataa aatgaatgca 
acaaataaag caatagcatc 
gttgtggttt gtccaaactc 
tctgtgtcct cataaaccct 
ccatccaccc tctgtgtcct 
gggtttttca cagaccgctt 
tgctgtgttc cagaagtgtt 
ctgtcagctt tgcacaaggg 
ttagtaatgt gcaaaacagg 
gtgttttcat ttttacttgg 
tccaaaacag ccttgtggtc 
gttacagttt gagcaggata 
ggttccccac caacagcaaa 
ttttcatgag ttttttgtgt 
tcagttttaa cagtaacagc 
aaattaggca aaggaattct 
ttaatgtcat gataataatg 
gcggaacccc tatttgttta 
aataaccctg ataaatgctt 
tccgtgtcgc ccttattccc 
aaacgctggt gaaagtaaaa 
aactggatct caacagcggt 
tgatgagcac ttttaaagtt 
aagagcaact cggtcgccgc 
tcacagaaaa gcatcttacg 
ccatgagtga taacactgcg 
taaccgcttt tttgcacaac 



tgtggaatgt gtgtcagtta 58 8 0 
tgcaaagcat gcatctcaat 594 0 
caggcagaag tatgcaaagc 6000 
ctccgcccat cccgccccta 60 60 
taattttttt tatttatgca 6120 
agtgaggagg cttttttgga 618 0 
tgcgagatat gtttgagaat 624 0 
gacgcggact catgtgaaat 6300 
cctattttcc cctcgaacac 6360 
agcgaaaaat acatcgtcac 6420 
cgactgatgc cttctgaaca 64 80 
ccgggtgcgt tactggcgcg 654 0 
agctacgatc acgacaacca 6600 
gaaggcttca tcgttattga 6660 
gaaatgtatc caaaagcgca 6720 
gttgatgact atgttgttga 6780 
ggcgtcgtat tcgtcccgcc 6840 
ggcgttgttc tttttaactt 6900 
tgcatccatg acacaggcaa 69 60 
tgaaacctcg acgctagtcc 7 020 
ccttgatggt aaaaccatcc 7080 
gcgcggcatt cpacccacgcg 714 0 
cgaacgtacc gacgatgatt 7200 
ttatgagtgg gccccggatc 7260 
acaaactacc tacagagatt 7320 
gtgttaaact actgattcta 7380 
aatgggagca gtggtggaat 74 40 
catctagtga tgatgaggct 7500 
gaaaggtaga agaccccaag 7 5 60 
tgtttagtaa tagaactctt 7620 
tgctatacaa gaaaattatg 7680 
ataatcataa catactgttt 7740 
actatgctca aaaattgtgt 7800 
atttgatgta tagtgccttg 78 60 
ttacttgctt taaaaaacct 7920 
attgttgttg ttaacttgtt 7980 
acaaatttca caaataaagc 8040 
atcaatgtat cttatcatgt 8100 
aacctcctct acttgagagg 8160 
cctgttaatt aggtcactta 8220 
tctaagggta attttaaaat 8280 
ggtaaacagc ccacaaatgt 8340 
cccaacaccc tgctcatcaa 84 00 
aggcacattt tccccacctg 84 60 
atcaggaacc cagcactcca 8520 
agtgttcatc tgctgactgt 8580 
tttggtcctg tagtttgcta 8 64 0 
aaaatgaaaa tttgaccctt 8700 
ccctgaatgc aagtttaaca 87 60 
ttcccacatc aaaatatttc 8820 
tgaagacgaa agggcctcgt 8880 
gtttcttaga cgtcaggtgg 8940 
tttttctaaa tacattcaaa 9000 
caataatatt gaaaaaggaa 9060 
ttttttgcgg cattttgcct 9120 
gatgctgaag atcagttggg 9180 
aagatccttg agagttttcg 9240 
ctgctatgtg gcgcggtatt 9300 
atacactatt ctcagaatga 9360 
gatggcatga cagtaagaga 9420 
gccaacttac ttctgacaac 9480 
atgggggatc atgtaactcg 9540 



ccttgatcgt tgggaaccgg agctgaatga 
gatgcctgca gcaatggcaa caacgttgcg 
agcttcccgg caacaattaa tagactggat 
gcgctcggcc cttccggctg gctggtttat 
gtctcgcggt atcattgcag cactggggcc 
ctacacgacg gggagtcagg caactatgga 
tgcctcactg attaagcatt ggtaactgtc 
tgatttaaaa cttcattttt aatttaaaag 
catgaccaaa atcccttaac gtgagttttc 
gatcaaagga tcttcttgag atcctttttt 
aaaaccaccg ctaccagcgg tggtttgttt 
gaaggtaact ggcttcagca gagcgcagat 
gttaggccac cacttcaaga actctgtagc 
gttaccagtg gctgctgcca gtggcgataa 
atagttaccg gataaggcgc agcggtcggg 
cttggagcga acgacctaca ccgaactgag 
cacgcttccc gaagggagaa aggcggacag 
agagcgcacg agggagcttc cagggggaaa 
tcgccacctc tgacttgagc gtcgattttt 
gaaaaacgcc agcaacgcgg cctttttacg 
catgttcttt cctgcgttat cccctgattc 
agctgatacc gctcgccgca gccgaacgac 
ggaagagcgc ctgatgcggt attttctcct 
accgcctcag aagccataga gcccaccgca 
tcctccccct tgctgtcctg ccccacccca 
caatgcgatg caatttcctc attttattag 
gtcaaggaag gcacggggga ggggcaaaca 
gaggctgatc agcgagctct agcatttagg 
atgctcgagc ggccgcttct ttattcttgg 
gcaaatattt cattaatgta gttgtggcca 
gcacttggag ttgtgtctcc tgtttcctgt 
ggtttagttt tgtctccgtt taagtaaact 
ccgtgagatt ttggataagc tgataggtta 
gtgccttcag taagatctcc atttctaaag 
tttagtagca ctccattttc gtcaaatctt 
gatattggag ccaaactgcc tttaacagcc 
ccacattttg ttaagaccaa agtgagttta 
ggagatggag ctggtgtggt ccacaaagtt 
atggcacctg tgctgtcaaa actaaggcca 
gaatcaaatt ctaggccatg gccaattttt 
ggtgaaccaa attcaagccc atctcctgca 
aaccccttgg cagtgcttag gttaacctca 
aagtaaaggc ctttgttgta gttaatatcc 
ccctgtccta gtcttagatt tagttggttt 
ataaggcgtc tgttttgaga atcaatcctt 
ccttgtgaat caaaacccaa ggctccagta 
gtcacacctg gaccagttgc tacggtcaaa 
gccccgtact ttagtcctag ttttccattt 
cccaagctac cagtggcagt agttagaggg 
tcggtggtgg tgagggggcc tgatgtttgc 
ggtccttggg tggcaatgct aagtttggag 
atggtgagtg tgttgcccgc gaccattaga 
gaggtaactg tgaggggtgc agatatttcc 
ggtgggctca cagtggttac attttgggag 
ccgttgccca ttttgagcgc aagcatgcca 
aaagagagta ccccaggggg actctcttga 
agaaaaggca cagttggagg accggtttcc 
tcttcagacg gtcttgcgcg cttcatcttg 
catccgccgt ctcaagaccg cctactttaa 
caaccccgac cgccacccgc tgccgcccgc 
ctggttagac gcctttctcg agaggttttc 
cctcggtggc ggagtaccgt tcggaggccg 
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agccatacca aacgacgagc gtgacaccac 9600 

caaactatta actggcgaac tacttactct 9660 

ggaggcggat aaagttgcag gaccacttct 97 20 

tgctgataaa tctggagccg gtgagcgtgg 97 80 

agatggtaag ccctcccgta tcgtagttat 9840 

tgaacgaaat agacagatcg ctgagatagg 9900 

agaccaagtt tact cat ata tactttagat 9960 

gatctaggtg aagatccttt ttgataatct 10020 

gttccactga gcgtcagacc ccgtagaaaa 1008 0 

tctgcgcgta atctgctgct tgcaaacaaa 1014 0 

gccggatcaa gagctaccaa ctctttttcc 10200 

accaaatact gtccttctag tgtagccgta 10260 

accgcctaca tacctcgctc tgctaatcct 10320 

gtcgtgtctt accgggttgg actcaagacg 10380 

ctgaacgggg ggttcgtgca cacagcccag 104 4 0 

atacctacag cgtgagctat gagaaagcgc 10500 

gtatccggta agcggcaggg tcggaacagg 10560 

cgcctggtat ctttatagtc ctgtcgggtt 10620 

gtgatgctcg tcaggggggc ggagcctatg 10680 

gttcctggcc ttttgctggc cttttgctca 10740 

tgtggataac cgtattaccg cctttgagtg 10800 

cgagcgcagc gagtcagtga ejcgaggaagc 108 60 

tacgcatctg tgcggtattt cacaccgcat 10920 

tccccagcat gcctgctatt gtcttcccaa 10980 

ccccccagaa tagaatgaca cctactcaga 1104 0 

gaaaggacag tgggagtggc accttccagg 1110 0 

acagatggct ggcaactaga aggcacagtc 11160 

tgacactata gaatagggcc ctctagatgc 11220 

gcaatgtatg aaaaagtgta agaggatgtg 11280 

gaccagtccc atgaaaatga catagagtat 1134 0 

gtaccgttta gtgtaatggt tagtgttaca 11400 

tgactgacaa tgttactttt ggcagtttta 114 60 

ggcataaatc caacagcgtt tgtataggct 11520 

ttccaatatt ctgggtccag gaaggaattg 11580 

ataataagat gagcactttg aactgttcca 1164 0 

aaaactgaaa ctgtagcaag tatttgactg 11700 

gcatctttct ctgcatttag tctacagtta 11760 

agcttatcat tatttttgtt tcctactgta 11820 

gttcctagtt taggaaccat agccttgttt 11880 

gttttgaggg gatttgtgtt tggtgcatta 11940 

ttaatggcta tggctgtagc gtcaaacatc 12000 

agctttttgg aattgtttga agctgtaaac 12060 

aagttgtggg ctgagtttat aaaaagaggg 12120 

tgagcatcaa acggataact aacatcaagt 1218 0 

agtcctcctg ctacattaag ttgcatattg 12240 

actttagttt gcaaggaagt attattaata 12300 

gtgtttaggt cgtctgttac atgcaaagga 12360 

tgtgtataaa tgggctcttt caagtcaatg 12420 

ggtgaggcag tgatagtaag ggtactgcta 12480 

agggctagct ttccttctga cactgtgagg 12540 

tcgtgcacgg ttagcggggc ctgtgattgc 12600 

ggtgcggcgg cagccacagt tagggcttct 12660 

aggtttatgt ttgacttggt ttttttgaga 12720 

gtaaggttgc cggcctcgtc cagagagagg 12780 

ttggaggtaa ctagaggttc ggataggcgc 1284 0 

aacccattgg gggatacaaa gggaggagta 12900 

gtgtcatatg gatacacggg gttgaaggta 12960 

gatctcaagc ctgccacacc tcacctcgac 13020 

ttacatcatc agcagcacct ccgccagaaa 13080 

cacggtgctc agcctacctt gcgactgtga 13140 

cgatccggtc gatgcggact cgctcaggtc 13200 

acgggtttcc gatccaagag tactggaaag 13260 
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accgcgaaga 
tcggtaccaa 
agtgggttct 
accgcccatt 
ttggtgccaa 
gtcaaaccgc 
gatgactaat 
ataatgccag 
atacacttga 
atggaaagtc 
ggggtcgttg 
catatatggg 
tcaatgtcaa 
aagcagattc 
cgcgcactac 
ggcatcagag 
tctatggtgc 
ccgctatcgc 
gcgccctgac 
gggagctgca 



gtttgtcctc 
gcttgggtct 
ctagttagcc 
tgcgtcaatg 
aacaaactcc 
tatccacgcc 
acgtagatgt 
gcgggccatt 
tgtactgcca 
cctattggcg 
ggcggtcagc 
ctatgaacta 
cgcgtatatc 
ttcatgcaat 
tcagcgacct 
cagattgtac 
actctcagta 
tacgtgactg 
gggcttgtct 
tgtgtcagag 



aaccgcgagc 
ccctatagtg 
agagagctct 
gggcggagtt 
cattgacgtc 
cattgatgta 
actgccaagt 
taccgtcatt 
agtgggcagt 
ttactatggg 
caggcgggcc 
atgaccccgt 
tggcccgtac 
tgtcggtcaa 
ccaacacaca 
tgagagtcga 
caatctgctc 
ggtcatggct 
gctcccggca 
gttttcaccg 



ccaacagcga 
agtcgtatta 
gcttatatag 
gttacgacat 
aatggggtgg 
ctgccaaaac 
aggaaagtcc 
gacgtcaata 
ttaccgtaaa 
aacatacgtc 
atttaccgta 
aattgattac 
atcgcgaagc 
gccttgcctt 
agcagggagc 
ccatagggga 
tgatgccgca 
gcgccccgac 
tccgcttaca 
tcatcaccga 



gctcgaattc 
atttcgataa 
acctcccacc 
tttggaaagt 
agacttggaa 
cgcatcacca 
cataaggtca 
gggggcgtac 
tagtccaccc 
attattgacg 
agttatgtaa 
tattaataac 
agcgcaaaac 
gttgtagctt 
agatactggc 
tcgggagatc 
tagttaagcc 
acccgccaac 
gacaagctgt 
aacgcgcgag 



agatccgagc 
gccagtaagc 
gtacacgcct 
cccgttgatt 
atccccgtga 
tggtaatagc 
tgtactgggc 
ttggcatatg 
attgacgtca 
tcaatgggcg 
cgcggaactc 
tagtcaataa 
gcctaaccct 
aaattttgct 
ttaactatgc 
tcccgatccg 
agtatacact 
acccgctgac 
gaccgtctcc 
gcagc 



13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14455 



<210> 16 
<211> 10610 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: plasmid 



<400> 16 

gacggatcgg 

tctatgtcat 

tcctcttaca 

cgtgtttatt 

accaccacat 

caacctgcca 

aaaaagcatc 

ctgtcgagcc 

catgtcgctg 

cggcgaagga 

gcggtggtgc 

atacaacatg 

tgtcctccgg 

cagcaccaca 

ggggaccaca 

cctcataaac 

ccggtaccat 

ggccaaaacc 

gagagcccag 

acacaggcac 

atcccaggga 

cacgtaactc 

cagtatggta 

gcgccgagac 

agtcatattt 

ctcgccgctt 

ggcgccccct 

ccaccaccgc 

cgggaggagc 

aaaacctcaa 

tctacagcca 



gagatccgcg 
tttcatggga 
ctttttcata 
tttcaattgc 
agcttataca 
cctccctccc 
atatcatggg 
aaacgctcat 
tccagctgct 
gaagtccacg 
tgcagcagcg 
gcagtggtct 
gcacagcagc 
atattgttca 
gaacccacgt 
acgctggaca 
ataaacctct 
tgcccgccgg 
gactcgtaac 
acgtgcatac 
acaacccatt 
acgttgtgca 
gcgcgggttt 
aaccgagatc 
cctgaagcaa 
agatcgctct 
ggcttcgggt 
agaataagcc 
gggaagagct 
aatgaagatc 
aagaacagat 



cggtacacag 
ctggtctggc 
cattgcccaa 
agaaaatttc 
gatcaccgta 
aacacacaga 
taacagacat 
cagtgatatt 
gagccacagg 
cctacatggg 
cgcgaataaa 
cctcagcgat 
gcaccctgat 
aaatcccaca 
ggccatcata 
taaacattac 
gattaaacat 
ctatacactg 
catggatcat 
acttcctcag 
cctgaatcag 
ttgtcaaagt 
ctgtctcaaa 
gtgttggtcg 
aaccaggtgc 
gtgtagtagt 
tctatgtaaa 
acacccagcc 
ggaagaacca 
tattaagtga 
aatggcattt 



aattcaggag 
cacaactaca 
gaataaagaa 
aagtcatttt 
ccttaatcaa 
gtacacagtc 
attcttaggt 
aataaactcc 
ctgctgtcca 
ggtagagtca 
ctgctgccgc 
gattcgcacc 
ctcacttaaa 
gtgcaaggcg 
ccacaagcgc 
ctcttttggc 
ggcgccatcc 
cagggaaccg 
catgctcgtc 
gattacaagc 
cgtaaatccc 
gttacattcg 
aggaggtaga 
tagtgtcatg 
gggcgtgaca 
tgtagtatat 
ctccttcatg 
aacctacaca 
tgtttttttt 
acgcgctccc 
gtaagatgtt 



acacaactcc 
ttaatgaaat 
tcgtttgtgt 
tcattcagta 
actcacagaa 
ctttctcccc 
gttatattcc 
ccgggcagct 
acttgcggtt 
taatcgtgca 
cgccgctccg 
gcccgcagca 
tcagcacagt 
ctgtatccaa 
aggtagatta 
atgttgtaat 
accaccatcc 
ggactggaac 
atgatatcaa 
tcctcccgcg 
acactgcagg 
ggcagcagcg 
cgatccctac 
ccaaatggaa 
aacagatctg 
ccactctctc 
cgccgctgcc 
ttcgttctgc 
tttattccaa 
ctccggtggc 
gcacaatggc 



aagtgcatac 
atttgccaca 
tatgtttcaa 
gtatagcccc 
ccctagtatt 
ggctggcctt 
acacggtttc 
cacttaagtt 
gcttaacggg 
tcaggatagg 
tcctgcagga 
taaggcgcct 
aactgcagca 
agctcatggc 
agtggcgacc 
tcaccacctc 
taaaccagct 
aatgacagtg 
tgttggcaca 
ttagaaccat 
gaagacctcg 
gatgatcctc 
tgtacggagt 
cgccggacgt 
cgtctccggt 
aaagcatcca 
ctgataacat 
gagtcacaca 
aagattatcc 
gtggtcaaac 
ttccaaaagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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caaacggccc tcacgtccaa gtggacgtaa 
ataaacattc cagcaccttc aaccatgccc 
atatctctaa gcaaatcccg aatattaagt 
ccctccacct tcagcctcaa gcagcgaatc 
cctgtataag attcaaaagc ggaacattaa 
gcagggccag ctgaacataa tcgtgcaggt 
caggaacctt gacaaaagaa cccacactga 
ccagcgtagc cccgatgtaa gctttgttgc 
ctcaaaaaat caggcaaagc ctcgcgcaaa 
agataaaggc aggtaagctc cggaaccacc 
atgtctgcgg gtttctgcat aaacacaaaa 
aagcctgtct tacaacagga aaaacaaccc 
cggcgtgacc gtaaaaaaac tggtcaccgt 
tcatgtccgg agtcataatg taagactcgg 
gctaaaaagc gaccgaaata gcccggggga 
acagccccca taggaggtat aacaaaatta 
aaaccctcct gcctaggcaa aatagcaccc 
cagcggcagc ctaacagtca gccttaccag 
ctcgacacgg caccagctca atcagtcaca 
atatatagga ctaaaaaatg acgtaacggt 
cacgcgaacc tacgcccaga aacgaaagcc 
ttccgttttc ccacgttacg taacttcccg 
tcagtacaat ctgctctgat gccgcatagt 
tggaggtcgc tgagtagtgc gcgagcaaaa 
acaattgcat gaagaatctg cttagggtta 
ccagatatac gcgttgacat tgattattga 
cattagttca tagcccatat atggagttcc 
ctggctgacc gcccaacgac ccccgcccat 
taacgccaat agggactttc cattgacgtc 
acttggcagt acatcaagtg tatcatatgc 
gtaaatggcc cgcctggcat tatgcccagt 
agtacatcta cgtattagtc atcgctatta 
atgggcgtgg atagcggttt gactcacggg 
atgggagttt gttttggcac caaaatcaac 
ccccattgac gcaaatgggc ggtaggcgtg 
tctggctaac tagagaaccc actgcttact 
gggagaccca agcttggtac cgagctcgga 
gttgaggaca aactcttcgc ggtctttcca 
cgaacggtac tccgccaccg agggacctga 
tctcgagaaa ggcgtctaac cagtcacagt 
gcagcgggtg gcggtcgggg ttgtttctgg 
aggcggtctt gagacggcgg atggtcgagg 
aagcgcgcaa gaccgtctga agataccttc 
ggtcctccaa ctgtgccttt tcttactcct 
agtccccctg gggtactctc tttgcgccta 
cttgcgctca aaatgggcaa cggcctctct 
aatgtaacca ctgtgagccc acctctcaaa 
tctgcacccc tcacagttac ctcagaagcc 
gtcgcgggca acacactcac catgcaatca 
cttagcattg ccacccaagg acccctcaca 
tcaggccccc tcaccaccac cgatagcagt 
actactgcca ctggtagctt gggcattgac 
aaactaggac taaagtacgg ggctcctttg 
gtagcaactg gtccaggtgt gactattaat 
gccttgggtt ttgattcaca aggcaatatg 
gattctcaaa acagacgcct tatacttgat 
ctaaatctaa gactaggaca gggccctctt 
aactacaaca aaggccttta cttgtttaca 
aacctaagca ctgccaaggg gttgatgttt 
gatgggcttg aatttggttc acctaatgca 
ggccatggcc tagaatttga ttcaaacaag 
agttttgaca gcacaggtgc cattacagta 



aggctaaacc cttcagggtg aatctcctct 1920 
aaataattct catctcgcca ccttctcaat 1980 
ccggccattg taaaaatctg ctccagagcg 2040 
atgattgcaa aaattcaggt tcctcacaga 2100 
caaaaatacc gcgatcccgt aggtcccttc 2160 
ctgcacggac cagcgcggcc acttccccgc 2220 
ttatgacacg catactcgga gctatgctaa 2280 
atgggcggcg atataaaatg caaggtgctg 234 0 
aaagaaagca catcgtagtc atgctcatgc 2400 
acagaaaaag acaccatttt tctctcaaac 24 60 
taaaataaca aaaaaacatt taaacattag 2520 
ttataagcat aagacggact acggccatgc 2580 
gattaaaaag caccaccgac agctcctcgg 2640 
taaacacatc aggttgattc atcggtcagt 2700 
atacataccc gcaggcgtag agacaacatt 27 60 
ataggagaga aaaacacata aacacctgaa 2820 
tcccgctcca gaacaacata cagcgcttca 2880 
taaaaaagaa aacctattaa aaaaacacca 2940 
gtgtaaaaaa gggccaagtg cagagcgagt 3000 
taaagtccac aaaaaacacc cagaaaaccg 3060 
aaaaaaccca caacttcctc aaatcgtcac 3120 
gatcctctcc cgatccccta tggtcgactc 3180 
taagccagta tctgctccct gcttgtgtgt 324 0 
tttaagctac aacaaggcaa ggcttgaccg 3300 
ggcgttttgc gctgcttcgc gatgtacggg 3360 
ctagttatta atagtaatca attacggggt 3420 
gcgttacata acttacggta aatggcccgc 3480 
tgacgtcaat aatgacgtat gttcccatag 3540 
aatgggtgga ctatttacgg taaactgccc 3600 
caagtacgcc ccctattgac gtcaatgacg 3660 
acatgacctt atgggacttt cctacttggc 3720 
ccatggtgat gcggttttgg cagtacatca 3780 
gatttccaag tctccacccc attgacgtca 3840 
gggactttcc aaaatgtcgt aacaactccg 3900 
tacggtggga ggtctatata agcagagctc 3960 
ggcttatcga aattaatacg actcactata 4020 
tctgaattcg agctcgctgt tgggctcgcg 4080 
gtactcttgg atcggaaacc cgtcggcctc 4140 
gcgagtccgc atcgaccgga tcggaaaacc 4200 
cgcaaggtag gctgagcacc gtggcgggcg 4260 
cggaggtgct gctgatgatg taattaaagt 4320 
tgaggtgtgg caggcttgag atccaagatg 4380 
aaccccgtgt atccatatga cacggaaacc 4 4 40 
ccctttgtat cccccaatgg gtttcaagag 4500 
tccgaacctc tagttacctc caatggcatg 4560 
ctggacgagg ccggcaacct tacctcccaa 4 620 
aaaaccaagt caaacataaa cctggaaata 4 680 
ctaactgtgg ctgccgccgc acctctaatg 4740 
caggccccgc taaccgtgca cgactccaaa 4800 
gtgtcagaag gaaagctagc cctgcaaaca 4 8 60 
acccttacta tcactgcctc accccctcta 4 920 
ttgaaagagc ccatttatac acaaaatgga 4 980 
catgtaacag acgacctaaa cactttgacc 5040 
aatacttcct tgcaaactaa agttactgga 5100 
caacttaatg tagcaggagg actaaggatt 5160 
gttagttatc cgtttgatgc tcaaaaccaa 5220 
tttataaact cagcccacaa cttggatatt 5280 
gcttcaaaca attccaaaaa gcttgaggtt 5340 
gacgctacag ccatagccat taatgcagga 5400 
ccaaacacaa atcccctcaa aacaaaaatt 5460 
gctatggttc ctaaactagg aactggcctt 5520 
ggaaacaaaa ataatgataa gctaactttg 5580 



tggaccacac cagctccatc tcctaactgt 
actttggtct taacaaaatg tggcagtcaa 
aaaggcagtt tggctccaat atctggaaca 
gacgaaaatg gagtgctact aaacaattcc 
aatggagatc ttactgaagg cacagcctat 
tcagcttatc caaaatctca cggtaaaact 
ttaaacggag acaaaactaa acctgtaaca 
acaggagaca caactccaag tgcatactct 
aactacatta atgaaatatt tgccacatcc 
taaagaagcg gccgctcgag catgcatcta 
gctagagctc gctgatcagc ctcgactgtg 
ccctcccccg tgccttcctt gaccctggaa 
aatgaggaaa -ttgcatcgca ttgtctgagt 
gggcaggaca gcaaggggga ggattgggaa 
ggctctatgg cttctgaggc ggaaagaacc 
ccctgtagcg gcgcattaag cgcggcgggt 
cttgccagcg ccctagcgcc cgctcctttc 
gccggctttc cccgtcaagc tctaaatcgg 
ttacggcacc tcgaccccaa aaaacttgat 
ccctgataga cggtttttcg ccctttgacg 
ttgttccaaa ctggaacaac actcaaccct 
attttgggga tttcggccta ttggttaaaa 
aattaattct gtggaatgtg tgtcagttag 
ggcagaagta tgcaaagcat gcatctcaat 
ggctccccag caggcagaag tatgcaaagc 
ccgcccctaa ctccgcccat cccgccccta 
catggctgac taattttttt tatttatgca 
ttccagaagt agtgaggagg cttttttgga 
gcttgtatat ccattttcgg atctgatcaa 
gaacaagatg gattgcacgc aggttctccg 
gactgggcac aacagacaat cggctgctct 
gggcgcccgg ttctttttgt caagaccgac 
gaggcagcgc ggctatcgtg gctggccacg 
gttgtcactg aagcgggaag ggactggctg 
ctgtcatctc accttgctcc tgccgagaaa 
ctgcatacgc ttgatccggc tacctgccca 
cgagcacgta ctcggatgga agccggtctt 
caggggctcg cgccagccga actgttcgcc 
gatctcgtcg tgacccatgg cgatgcctgc 
ttttctggat tcatcgactg tggccggctg 
ttggctaccc gtgatattgc tgaagagctt 
ctttacggta tcgccgctcc cgattcgcag 
ttcttctgag cgggactctg gggttcgaaa 
cacgagattt cgattccacc gccgccttct 
gggacgccgg ctggatgatc ctccagcgcg 
ccaacttgtt tattgcagct tataatggtt 
caaataaagc atttttttca ctgcattcta 
cttatcatgt ctgtataccg tcgacctcta 
tgtttcctgt gtgaaattgt tatccgctca 
taaagtgtaa agcctggggt gcctaatgag 
cactgcccgc tttccagtcg ggaaacctgt 
gcgcggggag aggcggtttg cgtattgggc 
tgcgctcggt cgttcggctg cggcgagcgg 
tatccacaga atcaggggat aacgcaggaa 
ccaggaaccg taaaaaggcc gcgttgctgg 
agcatcacaa aaatcgacgc tcaagtcaga 
accaggcgtt tccccctgga agctccctcg 
ccggatacct gtccgccttt ctcccttcgg 
gtaggtatct cagttcggtg taggtcgttc 
ccgttcagcc cgaccgctgc gccttatccg 
gacacgactt atcgccactg gcagcagcca 
taggcggtgc tacagagttc ttgaagtggt 
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agactaaatg cagagaaaga tgctaaactc 5640 
atacttgcta cagtttcagt tttggctgtt 5700 
gttcaaagtg ctcatcttat tataagattt 57 60 
ttcctggacc cagaatattg gaactttaga 5820 
acaaacgctg ttggatttat gcctaaccta 588 0 
gccaaaagta acattgtcag tcaagtttac 5940 
ctaaccatta cactaaacgg tacacaggaa 6000 
atgtcatttt catgggactg gtctggccac 6060 
tcttacactt tttcatacat tgcccaagaa 6120 
gagggcccta ttctatagtg tcacctaaat 618 0 
ccttctagtt gccagccatc tgttgtttgc 624 0 
ggtgccactc ccactgtcct ttcctaataa 6300 
aggtgtcatt ctattctggg gggtggggtg 6360 
gacaatagca ggcatgctgg ggatgcggtg 6420 
agctggggct ctagggggta tccccacgcg 6480 
gtggtggtta cgcgcagcgt gaccgctaca 654 0 
gctttcttcc cttcctttct cgccacgttc 6600 
ggcatccctt tagggttccg atttagtgct 6660 
tagggtgatg gttcacgtag tgggccatcg 6720 
ttggagtcca cgttctttaa tagtggactc 6780 
atctcggtct attcttttga tttataaggg 6840 
aatgagctga tttaacaaaa atttaacgcg 6900 
ggtgtggaaa gtccccaggc tccccaggca 69 60 
tagtcagcaa ccaggtgtgg aaagtcccca 7 020 
atgcatctca attagtcagc aaccatagtc 7080 
actccgccca gttccgccca ttctccgccc 714 0 
gaggccgagg ccgcctctgc ctctgagcta 7 20 0 
ggcctaggct tttgcaaaaa gctcccggga 7 2 60 
gagacaggat gaggatcgtt tcgcatgatt 7 32 0 
gccgcttggg tggagaggct attcggctat 7380 
gatgccgccg tgttccggct gtcagcgcag 744 0 
ctgtccggtg ccctgaatga actgcaggac 7500 
acgggcgttc cttgcgcagc tgtgctcgac 7560 
ctattgggcg aagtgccggg gcaggatctc 7 62 0 
gtatccatca tggctgatgc aatgcggcgg 7 680 
ttcgaccacc aagcgaaaca tcgcatcgag 774 0 
gtcgatcagg atgatctgga cgaagagcat 7800 
aggctcaagg cgcgcatgcc cgacggcgag 7 8 60 
ttgccgaata tcatggtgga aaatggccgc 7 92 0 
ggtgtggcgg accgctatca ggacatagcg 7 98 0 
ggcggcgaat gggctgaccg cttcctcgtg 8 04 0 
cgcatcgcct tctatcgcct tcttgacgag 8100 
tgaccgacca agcgacgccc aacctgccat 8160 
atgaaaggtt gggcttcgga atcgttttcc 8220 
gggatctcat gctggagttc ttcgcccacc 828 0 
acaaataaag caatagcatc acaaatttca 834 0 
gttgtggttt gtccaaactc atcaatgtat 8400 
gctagagctt ggcgtaatca tggtcatagc 84 60 
caattccaca caacatacga gccggaagca 8520 
tgagctaact cacattaatt gcgttgcgct 858 0 
cgtgccagct gcattaatga atcggccaac 8 64 0 
gctcttccgc ttcctcgctc actgactcgc 8700 
tatcagctca ctcaaaggcg gtaatacggt 87 60 
agaacatgtg agcaaaaggc cagcaaaagg 8820 
cgtttttcca taggctccgc ccccctgacg 8880 
ggtggcgaaa cccgacagga ctataaagat 894 0_ 
tgcgctctcc tgttccgacc ctgccgctta 9000 
gaagcgtggc gctttctcaa tgctcacgct 9060 
gctccaagct gggctgtgtg cacgaacccc 9120 
gtaactatcg tcttgagtcc aacccggtaa 9180 
ctggtaacag gattagcaga gcgaggtatg 924 0 
ggcctaacta cggctacact agaaggacag 9300 
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tatttggtat 
gatccggcaa 
cgcgcagaaa 
agtggaacga 
cctagatcct 
cttggtctga 
ttcgttcatc 
taccatctgg 
tatcagcaat 
ccgcctccat 
atagtttgcg 
gtatggcttc 
tgtgcaaaaa 
cagtgttatc 
taagatgctt 
ggcgaccgag 
ctttaaaagt 
cgctgttgag 
ttactttcac 
gaataagggc 
gcatttatca 
aacaaatagg 



ctgcgctctg 
acaaaccacc 
aaaaggatct 
aaactcacgt 
tttaaattaa 
cagttaccaa 
catagttgcc 
ccccagtgct 
aaaccagcca 
ccagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 



ctgaagccag 
gctggtagcg 
caagaagatc 
taagggattt 
aaatgaagtt 
tgcttaatca 
tgactccccg 
gcaatgatac 
gccggaaggg 
aattgttgcc 
gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtact 
ccggcgtcaa 
ggaaaacgtt 
atgtaaccca 
gggtgagcaa 
tgttgaatac 
ctcatgagcg 
acatttcccc 



ttaccttcgg 
gtggtttttt 
ctttgatctt 
tggtcatgag 
ttaaatcaat 
gtgaggcacc 
tcgtgtagat 
cgcgagaccc 
ccgagcgcag 
gggaagctag 
caggcatcgt 
gatcaaggcg 
ctccgatcgt 
tgcataattc 
caaccaagtc 
tacgggataa 
cttcggggcg 
ctcgtgcacc 
aaacaggaag 
tcatactctt 
gatacatatt 
gaaaagtgcc 



aaaaagagtt 
tgtttgcaag 
ttctacgggg 
attatcaaaa 
ctaaagtata 
tatctcagcg 
aactacgata 
acgctcaccg 
aagtggtcct 
agtaagtagt 
ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
attctgagaa 
taccgcgcca 
aaaactctca 
caactgatct 
gcaaaatgcc 
cctttttcaa 
tgaatgtatt 
acctgacgtc 



ggtagctctt 
cagcagatta 
tctgacgctc 
aggatcttca 
tatgagtaaa 
atctgtctat 
cgggagggct 
gctccagatt 
gcaactttat 
tcgccagtta 
tcgtcgtttg 
tcccccatgt 
aagttggccg 
atgccatccg 
tagtgtatgc 
catagcagaa 
aggatcttac 
tcagcatctt 
gcaaaaaagg 
tattattgaa 
tagaaaaata 
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<210> 17 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 17 

tgtacaccgg atccggcgca cacc 24 



<210> 18 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

cacaacgagc tcaattaatt aattgccaca tcctc 35 



<210> 19 

<211> 4 

<212> PRT 

<213> adenovirus 

<400> 19 
Thr Leu Trp Thr 
1 



<210> 20 

<211> 12 

<212> PRT 

<213> adenovirus 
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<400> 20 

Pro Ser Ala Ser Ala Ser Ala Ser Ala Pro Gly Ser 
15 10 



<210> 21 

<211> 44 

<212> PRT 

<213> adenovirus 

<400> 21 

Met Lys Arg Ala Arg Pro Ser Glu 
1 5 

Tyr Asp Thr Glu Thr Gly Pro Pro 
20 

Phe Val Ser Pro Asn Gly Phe Gin 

35 40 



Asp Thr Phe Asn Pro Val Tyr Pro 
10 15 

Thr Val Pro Phe Leu Thr Pro Pro 
25 30 

Glu Ser Pro Pro 



<210> 22 

<211> 43 

<212> PRT 

<213> adenovirus 

<400> 22 

Met Ala Lys Arg Ala Arg Leu Ser Thr Ser Phe Asn Pro Val Tyr Pro 
15 10 15 

Tyr Glu Asp Glu Ser Ser Ser Gin His Pro Phe lie Asn Pro Gly Phe 
20 25 30 

lie Ser Pro Asp Gly Phe Thr Gin Ser Pro Asn 
35 40 



<210> 23 

<211> 43 

<212> PRT 

<213> adenovirus 

<400> 23 

Met Ser Lys Arg Leu Arg Val Glu Asp Asp Phe Asn Pro Val Tyr Pro 
15 10 15 

Tyr Gly Tyr Ala Arg Asn Gin Asn lie Pro Phe Leu Thr Pro Pro Phe 
20 25 30 

Val Ser Ser Asp Gly Phe Lys Asn Phe Pro Pro 
35 40 



<210> 24 

<211> 42 

<212> PRT 

<213> adenovirus 



<400> 24 

Met Lys Arg Ala Arg Phe Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 
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Glu His Tyr Asn Pro Leu Asp lie Pro Phe He Thr Pro Pro Phe Ala 
20 * 25 30 

Ser Ser Asn Gly Leu Gin Glu Lys Pro Pro 
35 40 



<210> 25 

<211> 42 

<212> PRT 

<213> adenovirus 



<400> 25 

Met Lys Arg Thr Arg He Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 

Asp Thr Ser Ser Thr Pro Ser He Pro Tyr Val Ala Pro Pro Phe Val 
20 25 30 



Ser Ser Asp Gly Leu Gin Glu Asn Pro Pro 
35 40 



<210> 26 
<211> 327 
<212> DNA 
<213> adenovirus 



<400> 26 

agatctgaat tcgagctcgc 
ccagtactct tggatcggaa 
tgagcgagtc cgcatcgacc 
agtcgcaagg taggctgagc 
tggcggaggt gctgctgatg 
aggtgaggtg tggcaggctt 



tgttgggctc gcggttgagg 
acccgtcggc ctccgaacgg 
ggatcggaaa acctctcgag 
accgtggcgg gcggcagcgg 
atgtaattaa agtaggcggt 
gagatct 



acaaactctt cgcggtcttt 60 
tactccgcca ccgagggacc 120 
aaaggcgtct aaccagtcac 18 0 
gtggcggtcg gggttgtttc 240 
cttgagacgg cggatggtcg 300 

327 



<210> 27 
<211> 32480 
<212> DNA 
<213> adenovirus 

<400> 27 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
agtgaaatct gaataatttt gtgttactca tagcgcgtaa tctctagcat cgatgtcgac 360 
aagcttgaat tcgattaatg tgagttagct cactcattag gcaccccagg ctttacactt 420 
tatgcttccg gctcgtatgt tgtgtggaat tgtgagcgga taacaatttc acacaggaaa 480 
cagctatgac catgattacg aattcggcgc agcaccatgg cctgaaataa cctctgaaag 540 
aggaacttgg ttaggtacct tctgaggcgg aaagaaccag ctgtggaatg tgtgtcagtt 600 
agggtgtgga aagtccccag gctccccagc aggcagaagt atgcaaagca tgcatctcaa 660 
ttagtcagca accaggtgtg gaaagtcccc aggctcccca gcaggcagaa gtatgcaaag 720 
catgcatctc aattagtcag caaccatagt cccgccccta actccgccca tcccgcccct 780 
aactccgccc agttccgccc attctccgcc ccatggctga ctaatttttt ttatttatgc 84 0 
agaggccgag gccgcctcgg cctctgagct attccagaag tagtgaggag gcttttttgg 900 
aggcctaggc ttttgcaaaa agcttgggat ctctataatc tcgcgcaacc tattttcccc 960 
tcgaacactt tttaagccgt agataaacag gctgggacac ttcacatgag cgaaaaatac 1020 
atcgtcacct gggacatgtt gcagatccat gcacgtaaac tcgcaagccg actgatgcct 1080 
tctgaacaat ggaaaggcat tattgccgta agccgtggcg gtctggtacc ggtgggtgaa 1140 
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gaccagaaac agcacctcga actgagccgc 
ggcgagatcg atcccgtcgt tttacaacgt 
cttaatcgcc ttgcagcaca tccccctttc 
accgatcgcc cttcccaaca gttgcgcagc 
ccggcaccag aagcggtgcc ggaaagctgg 
gtcgtcgtcc cctcaaactg gcagatgcac 
acctatccca ttacggtcaa tccgccgttt 
tcgctcacat ttaatgttga tgaaagctgg 
gatggcgtta actcggcgtt tcatctgtgg 
gacagtcgtt tgccgtctga atttgacctg 
ctcgcggtga tggtgctgcg ttggagtgac 
cggatgagcg gcattttccg tgacgtctcg 
gatttccatg ttgccactcg ctttaatgat 
gttcagatgt gcggcgagtt gcgtgactac 
gaaacgcagg tcgccagcgg caccgcgcct 
ggttatgccg atcgcgtcac actacgtctg 
gaaatcccga atctctatcg tgcggtggtt 
gaagcagaag cctgcgatgt cggtttccgc 
ctgaacggca agccgttgct gattcgaggc 
ggtcaggtca tggatgagca gacgatggtg 
tttaacgccg tgcgctgttc gcattatccg 
cgctacggcc tgtatgtggt ggatgaagcc 
aatcgtctga ccgatgatcc gcgctggcta 
gtgcagcgcg atcgtaatca cccgagtgtg 
cacggcgcta atcacgacgc gctgtatcgc 
gtgcagtatg aaggcggcgg agccgacacc 
gcgcgcgtgg atgaagacca gcccttcccg 
ctttcgctac ctggagagac gcgcccgctg 
aacagtcttg gcggtttcgc taaatactgg 
ggcggcttcg tctgggactg ggtggatcag 
ccgtggtcgg cttacggcgg tgattttggc 
aacggtctgg tctttgccga ccgcacgccg 
cagcagtttt tccagttccg tttatccggg 
ttccgtcata gcgataacga gctcctgcac 
gcaagcggtg aagtgcctct ggatgtcgct 
gaactaccgc agccggagag cgccgggcaa 
aacgcgaccg catggtcaga agccgggcac 
gaaaacctca gtgtgacgct ccccgccgcg 
gaaatggatt tttgcatcga gctgggtaat 
tttctttcac agatgtggat tggcgataaa 
ttcacccgtg caccgctgga taacgacatt 
aacgcctggg tcgaacgctg gaaggcggcg 
cagtgcacgg cagatacact tgctgatgcg 
catcagggga aaaccttatt tatcagccgg 
atggcgatta ccgttgatgt tgaagtggcg 
ctgaactgcc agctggcgca ggtagcagag 
gaaaactatc ccgaccgcct tactgccgcc 
gacatgtata ccccgtacgt cttcccgagc 
ttgaattatg gcccacacca gtggcgcggc 
caacagcaac tgatggaaac cagccatcgc 
- ctgaatatcg acggtttcca tatggggatt 
tcggcggaat tccagctgag cgccggtcgc 
taataataac cgggcaggcc atgtctgccc 
tatttaaaaa acacaaactt ttggatgttc 
atgggagcct acttcccgtt tttcccgatt 
agtgatacgg gtattatttt tgccgctatt 
tttggtctgc tttctgacaa actcggaact 
aaagcaatag catcacaaat ttcacaaata 
gtttgtccaa actcatcaat gtatcttatc 
agggtgggaa agaatatata aggtgggggt 
gccgccgccg ccatgagcac caactcgttt 
acgcgcatgc ccccatgggc cggggtgcgt 



gatattgccc agcgtttcaa cgcgctgtat 1200 
cgtgactggg aaaaccctgg cgttacccaa 1260 
gccagctggc gtaatagcga agaggcccgc 1320 
ctgaatggcg aatggcgctt tgcctggttt 1380 
ctggagtgcg atcttcctga ggccgatact 1440 
ggttacgatg cgcccatcta caccaacgta 1500 
gttcccacgg agaatccgac gggttgttac 1560 
ctacaggaag gccagacgcg aattattttt 1620 
tgcaacgggc gctgggtcgg ttacggccag 1680 
agcgcatttt tacgcgccgg agaaaaccgc 1740 
ggcagttatc tggaagatca ggatatgtgg 1800 
ttgctgcata aaccgactac acaaatcagc 18 60 
gatttcagcc gcgctgtact ggaggctgaa 1920 
ctacgggtaa cagtttcttt atggcagggt 1980 
ttcggcggtg aaattatcga tgagcgtggt 2040 
aacgtcgaaa acccgaaact gtggagcgcc 2100 
gaactgcaca ccgccgacgg cacgctgatt 2160 
gaggtgcgga ttgaaaatgg tctgctgctg 2220 
gttaaccgtc acgagcatca tcctctgcat 2280 
caggatatcc tgctgatgaa gcagaacaac 2340 
aaccatccgc tgtggtacac gctgtgcgac 2400 
aatattgaaa cccacggcat ^gtgccaatg 24 60 
ccggcgatga gcgaacgcgt aacgcgaatg 2520 
atcatctggt cgctggggaa tgaatcaggc 2580 
tggatcaaat ctgtcgatcc ttcccgcccg 2640 
acggccaccg atattatttg cccgatgtac 2700 
gctgtgccga aatggtccat caaaaaatgg 2760 
atcctttgcg aatacgccca cgcgatgggt 2820 
caggcgtttc gtcagtatcc ccgtttacag 2880 
tcgctgatta aatatgatga aaacggcaac 2940 
gatacgccga acgatcgcca gttctgtatg 3000 
catccagcgc tgacggaagc aaaacaccag 3060 
caaaccatcg aagtgaccag cgaatacctg 3120 
tggatggtgg cgctggatgg taagccgctg 3180 
ccacaaggta aacagttgat tgaactgcct 3240 
ctctggctca cagtacgcgt agtgcaaccg 3300 
atcagcgcct ggcagcagtg gcgtctggcg 3360 
tcccacgcca tcccgcatct gaccaccagc 3420 
aagcgttggc aatttaaccg ccagtcaggc 3480 
aaacaactgc tgacgccgct gcgcgatcag 354 0 
ggcgtaagtg aagcgacccg cattgaccct 3600 
ggccattacc aggccgaagc agcgttgttg 3660 
gtgctgatta cgaccgctca cgcgtggcag 3720 
aaaacctacc ggattgatgg tagtggtcaa 3780 
agcgatacac cgcatccggc gcggattggc 3840 
cgggtaaact ggctcggatt agggccgcaa 3900 
tgttttgacc gctgggatct gccattgtca 3960 
gaaaacggtc tgcgctgcgg gacgcgcgaa 4 020 
gacttccagt tcaacatcag ccgctacagt 4 080 
catctgctgc acgcggaaga aggcacatgg 4140 
ggtggcgacg actcctggag ccrcgtcagta 4200 
taccattacc agttggtctg gtgtcaaaaa 4260 
gtatttcgcg taaggaaatc cattatgtac 4320 
ggtttattct ttttctttta cttttttatc 4380 
tggctacatg acatcaacca tatcagcaaa 4440 
tctctgttct cgctattatt ccaaccgctg 4 500_ 
tgtttattgc agcttataat ggttacaaat 4560 
aagcattttt ttcactgcat tctagttgtg 4 620 
atgtctggat ccagatctgg gcgtggctta 4 680 
cttatgtagt tttgtatctg ttttgcagca 4740 
gatggaagca ttgtgagctc atatttgaca 4 800 
cagaatgtga tgggctccag cattgatggt 4 8 60 



I 

it 



cgccccgtcc tgcccgcaaa ctctactacc 
ttggagactg cagcctccgc cgccgcttca 
actgactttg ctttcctgag cccgcttgca 
gatgacaagt tgacggctct tttggcacaa 
gtttctcagc agctgttgga tctgcgccag 
cccaatgcgg tttaaaacat aaataaaaaa 
gtgtcttgct gtctttattt aggggttttg 
cggtcgttga gggtcctgtg tattttttcc 
agatacatgg gcataagccc gtctctgggg 
tgcggggtgg tgttgtagat gatccagtcg 
atgtctttca gtagcaagct gattgccagg 
cggttaagct gggatgggtg catacgtggg 
aggttggcta tgttcccagc catatccctc 
acagtgtatc cggtgcactt gggaaatttg 
aacttggaga cgcccttgtg acctccaaga 
atgggcccac gggcggcggc ctgggcgaag 
tgttccagga tgagatcgtc ataggccatt 
tgcggtataa tggttccatc cggcccaggg 
cacgctttga gttcagatgg ggggatcatg 
tccggggtag gggagatcag ctgggaagaa 
cagccggtgg gcccgtaaat cacacctatt 
cagctgccgt catccctgag caggggggcc 
ttttccctga ccaaatccgc cagaaggcgc 
gaagcaaagt ttttcaacgg tttgagaccg 
ccaagcagtt ccaggcggtc ccacagctcg 
atatctcctc gtttcgcggg ttggggcggc 
ccagacgggc cagggtcatg tctttccacg 
tcacggtgaa ggggtgcgct ccgggctgcg 
tgctggtgct gaagcgctgc cggtcttcgc 
tggtgtcata gtccagcccc tccgcggcgt 
aggcgccgca cgaggggcag tgcagacttt 
ccgattccgg ggagtaggca tccgcgccgc 
gccaggtgag ctctggccgt tcggggtcaa 
gtttcttacc tctggtttcc atgagccggt 
tgtccccgta tacagacttg agaggcctgt 
atagaaactc ggaccactct gagacaaagg 
agtgggaggg gtagcggtcg ttgtccacta 
acatgtcgcc ctcttcggca tcaaggaagg 
cgggtgttcc tgaagggggg ctataaaagg 
ccgcatcgct gtctgcgagg gccagctgtt 
tgacttctgc gctaagattg tcagtttcca 
ccgcggtgat gcctttgagg gtggccgcat 
tgtcaagctt ggtggcaaac gacccgtaga 
gcagggtttg gtttttgtcg cgatcggcgc 
attcgcgcgc aacgcaccgc cattcgggaa 
gcacgcgcca accgcggttg tgcagggtga 
gtaggcgctc gttggtccag cagaggcggc 
ggtctagctg cgtctcgtcc ggggggtctg 
gcgcgtcgaa gtagtctatc ttgcatcctt 
cggcaagcgc gcgctcgtat gggttgagtg 
cggaggcgta catgccgcaa atgtcgtaaa 
atgtagggta gcatcttcca ccgcggatgc 
agggagcgag gaggtcggga ccgaggttgc 
tctgcctgaa gatggcatgt gagttggatg 
tggcgtctgt gagacctacc gcgtcacgca 
tgaccagctc ggcggtgacc tgcacgtcta 
tgtcatactt atcctgtccc ttttttttcc 
ggtctttcca gtactcttgg atcggaaacc 
tgtagaactg gttgacggcc tggtaggcgc 
cctgcgcggc cttccggagc gaggtgtggg 
tgaggtactg gtatttgaag tcagtgtcgt 
ccgtgcgctt tttggaacgc ggatttggca 
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ttgacctacg agaccgtgtc tggaacgccg 4 920 
gccgctgcag ccaccgcccg cgggattgtg 4980 
agcagtgcag cttcccgttc atccgcccgc 5040 
ttggattctt tgacccggga acttaatgtc 5100 
caggtttctg ccctgaaggc ttcctcccct 5160 
ccagactctg tttggatttg gatcaagcaa 5220 
cgcgcgcggt aggcccggga ccagcggtct 5280 
aggacgtggt aaaggtgact ctggatgttc 534 0 
tggaggtagc accactgcag agcttcatgc 5400 
tagcaggagc gctgggcgtg gtgcctaaaa 54 60 
ggcaggccct tggtgtaagt gtttacaaag 5520 
gatatgagat gcatcttgga ctgtattttt 5580 
cggggattca tgttgtgcag aaccaccagc 5640 
tcatgtagct tagaaggaaa tgcgtggaag 5700 
ttttccatgc attcgtccat aatgatggca 5760 
atatttctgg gatcactaac gtcatagttg 5820 
tttacaaagc gcgggcggag ggtgccagac 5880 
gcgtagttac cctcacagat ttgcatttcc 5940 
tctacctgcg gggcgatgaa gaaaacggtt 6000 
agcaggttcc tgagcagctg cgacttaccg 6060 
accgggtgca actggtagtt aagagagctg 6120 
acttcgttaa gcatgtccct gactcgcatg 6180 
tcgccgccca gcgatagcag ttcttgcaag 624 0 
tccgccgtag gcatgctttt gagcgtttga 6300 
gtcacctgct ctacggcatc tcgatccagc 6360 
tttcgctgta cggcagtagt cggtgctcgt 6420 
ggcgcagggt cctcgtcagc gtagtctggg 6480 
cgctggccag ggtgcgcttg aggctggtcc 6540 
cctgcgcgtc ggccaggtag catttgacca 6600 
ggcccttggc gcgcagcttg cccttggagg 6660 
tgagggcgta gagcttgggc gcgagaaata 6720 
aggccccgca gacggtctcg cattccacga 6780 
aaaccaggtt tcccccatgc tttttgatgc 6840 
gtccacgctc ggtgacgaaa aggctgtccg 6900 
cctcgagcgg tgttccgcgg tcctcctcgt 6960 
ctcgcgtcca ggccagcacg aaggaggcta 7 020 
gggggtccac tcgctccagg gtgtgaagac 7080 
tgattggttt gtaggtgtag gccacgtgac 7140 
gggtgggggc gcgttcgtcc tcactctctt 7200 
ggggtgagta ctccctctga aaagcgggca 7260 
aaaacgagga ggatttgata ttcacctggc 7320 
ccatctggtc agaaaagaca atctttttgt 7380 
gggcgttgga cagcaacttg gcgatggagc 74 4 0 
gctccttggc cgcgatgttt agctgcacgt 7500 
agacggtggt gcgctcgtcg ggcaccaggt 7560 
caaggtcaac gctggtggct acctctccgc 7 620 
cgcccttgcg cgagcagaat ggcggtaggg 7 680 
cgtccacggt aaagaccccg ggcagcaggc 7740 
gcaagtctag cgcctgctgc catgcgcggg 7800 
ggggacccca tggcatgggg tgggtgagcg 7860 
cgtagagggg ctctctgagt attccaagat 7 920 
tggcgcgcac gtaatcgtat agttcgtgcg 7 980 
tacgggcggg ctgctctgct cggaagacta 8040 
atatggttgg acgctggaag acgttgaagc 8100 
cgaaggaggc gtaggagtcg cgcagcttgt 8160 
gggcgcagta gtccagggtt tccttgatga 8220 
acagctcgcg gttgaggaca aactcttcgc 8280 
cgtcggcctc cgaacggtaa gagcctagca 834 0 
agcatccctt ttctacgggt agcgcgtatg 8400 
tgagcgcaaa ggtgtccctg accatgactt 8460 
cgcatccgcc ctgctcccag agcaaaaagt 8520 
gggcgaaggt gacatcgttg aagagtatct 8580 
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ttcccgcgcg aggcataaag ttgcgtgtga 
tgttaattac ctgggcggcg agcacgatct 
tgtaaagttc caagaagcgc gggatgccct 
aggtgagctc ttcaggggag ctgagcccgt 
ggttggaagc gacgaatgag ctccacaggt 
gaaaggtcct aaactggcga cctatggcca 
gcgggtcttg ttcccagcgg tcccatccaa 
ctagaggctc atctccgccg aacttcatga 
aggcccccat ccaagtatag gtctctacat 
gatgcgagcc gatcgggaag aactggatct 
tgtggtgaaa gtagaagtcc ctgcgacggg 
gtgcgcagta ctggcagcgg tgcacgggct 
cgcgcacaag gaagcagagt gggaatttga 
cttctacttp ggctgcttgt ccttgaccgt 
ggaccaccac gccgcgcgag cccaaagtcc 
tgacaacatc gcgcagatgg gagctgtcca 
gcgggagctc ctgcaggttt acctcgcata 
gatacctaat ttccaggggc tggttggtgg 
cccgcggcgc gactacggta ccgcgcggcg 
atgcatctaa aagcggtgac gcgggcgagc 
cgggagaggg ggcaggggca cgtcggcgcc 
taggttgctg gcgaacgcga cgacgcggcg 
gaagacgacg ggcccggtga gcttgagcct 
gtcgttgacg gcggcctggc gcaaaatctc 
gatctcggcc atgaactgct cgatctcttc 
cacggtggcg gcgaggtcgt tggaaatgcg 
tccctcgttc cagacgcggc tgtagaccac 
cacctgcgcg agattgagct ccacgtgccg 
aaagaggtag ttgagggtgg tggcggtgtg 
tcgcaacgtg gattcgttga tatcccccaa 
gtccacggcg aagttgaaaa actgggagtt 
aagacggatg agctcggcga cagtgtcgcg 
ttcttcttct tcaatctcct cttccataag 
tgggggaggg gggacacggc ggcgacgacg 
gatcatctcc ccgcggcgac ggcgcatggt 
gcgcagttgg aagacgccgc ccgtcatgtc 
cggcagggat acggcgctaa cgatgcatct 
gagggacctg agcgagtccg catcgaccgg 
ccagtcacag tcgcaaggta ggctgagcac 
gttgtttctg gcggaggtgc tgctgatgat 
gatggtcgac agaagcacca tgtccttggg 
catgccccag gcttcgtttt gacatcggcg 
ttctaccggc acttcttctt ctccttcctc 
ggcggcggcg gagtttggcc gtaggtggcg 
gcccctcatc ggctgaagca gggctaggtc 
ctgcacctgc gtgagggtag actggaagtc 
cgtgttgatg gtgtaagtgc agttggccat 
ctgcgagagc tcggtgtacc tgagacgcga 
gcaagtccgc accaggtact ggtatcccac 
gggccagcgt agggtggccg gggctccggg 
tccgtagatg tacctggaca tccaggtgat 
gtcgcggacg cggttccaga tgttgcgcag 
ctggccggtc aggcgcgcgc aatcgttgac 
agcgggcact cttccgtggt ctggtggata 
gggttcgagc cccgtatccg gccgtccgcc 
aacccaggtg tgcgacgtca gacaacgggg 
ggcggctgct gcgctagctt ttttggccac 
gaaagcgaaa gcattaagtg gctcgctccc 
agtcgcggga cccccggttc gagtctcgga 
ctccccgtca tgcaagaccc cgcttgcaaa 
ttgcttttcc cagatgcatc cggtgctgcg 
agagcaagag cagcggcaga catgcagggc 



tgcggaaggg tcccggcacc tcggaacggt 864 0 
cgtcaaagcc gttgatgttg tggcccacaa 8700 
tgatggaagg caatttttta agttcctcgt 87 60 
gctctgaaag ggcccagtct gcaagatgag 8 820 
cacgggccat tagcatttgc aggtggtcgc 88 80 
ttttttctgg ggtgatgcag tagaaggtaa 8940 
ggttcgcggc taggtctcgc gcggcagtca 9000 
ccagcatgaa gggcacgagc tgcttcccaa 9060 
cgtaggtgac aaagagacgc tcggtgcgag 9120 
cccgccacca attggaggag tggctattga 9180 
ccgaacactc gtgctggctt ttgtaaaaac 9240 
gtacatcctg cacgaggttg acctgacgac 9300 
gcccctcgcc tggcgggttt ggctggtggt 9360 
ctggctgctc gaggggagtt acggtggatc 94 20 
agatgtccgc gcgcggcggt cggagcttga 94 80 
tggtctggag ctcccgcggc gtcaggtcag 954 0 
gacgggtcag ggcgcgggct agatccaggt 9600 
cggcgtcgat ggcttgcaag aggccgcatc 9660 
ggcggtgggc cgcgggggtg tccttggatg 9720 
ccccggaggt agggggggct ccggacccgc 97 8 0 
gcgcgcgggc aggagctggt gctgcgcgcg 98 40 
gttgatctcc tgaatctggc gcctctgcgt 9900 
gaaagagagt tcgacagaat caatttcggt 9960 
ctgcacgtct cctgagttgt cttgataggc 10020 
ctcctggaga tctccgcgtc cggctcgctc 10080 
ggccatgagc tgcgagaagg cgttgaggcc 10140 
gcccccttcg gcatcgcggg cgcgcatgac 10200 
ggcgaagacg gcgtagtttc gcaggcgctg 10260 
ttctgccacg aagaagtaca taacccagcg 10320 
ggcctcaagg cgctccatgg cctcgtagaa 10380 
gcgcgccgac acggttaact cctcctccag 104 4 0 
cacctcgcgc tcaaaggcta caggggcctc 10500 
ggcctcccct tcttcttctt ctggcggcgg 10560 
gcgcaccggg aggcggtcga caaagcgctc 10620 
ctcggtgacg gcgcggccgt tctcgcgggg 10 68 0 
ccggttatgg gttggcgggg ggctgccatg 10740 
caacaattgt tgtgtaggta ctccgccgcc 10800 
atcggaaaac ctctcgagaa aggcgtctaa 108 60 
cgtggcgggc ggcagcgggc ggcggtcggg 10 920 
gtaattaaag taggcggtct tgagacggcg 10980 
tccggcctgc tgaatgcgca ggcggtcggc 11040 
caggtctttg tagtagtctt gcatgagcct 11100 
ttgtcctgca tctcttgcat ctatcgctgc 11160 
ccctcttcct cccatgcgtg tgaccccgaa 11220 
ggcgacaacg cgctcggcta atatggcctg 11280 
atccatgtcc acaaagcggt ggtatgcgcc 11340 
aacggaccag ttaacggtct ggtgacccgg 11400 
gtaagccctc gagtcaaata cgtagtcgtt 114 60 
caaaaagtgc ggcggcggct ggcggtagag 11520 
ggcgagatct tccaacataa ggcgatgata 11580 
gccggcggcg gtggtggagg cgcgcggaaa 11640 
cggcaaaaag tgctccatgg tcgggacgct 11700 
gctctagacc gtgcaaaagg agagcctgta 117 60 
aattcgcaag ggtatcatgg cggacgaccg 11820 
gtgatccatg cggttaccgc ccgcgtgtcg 11880 
gagtgctcct tttggcttcc ttccaggcgc 1194.0 
tggccgcgcg cagcgtaagc ggttaggctg 12000 
tgtagccgga gggttatttt ccaagggttg 12060 
ccggccggac tgcggcgaac gggggtttgc 12120 
ttcctccgga aacagggacg agcccctttt 12180 
gcagatgcgc ccccctcctc agcagcggca 1224 0 
accctcccct cctcctaccg cgtcaggagg 12300 
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ggcgacatcc gcggttgacg cggcagcaga 
ccggcactac ctggacttgg aggagggcga 
tgagcggtac ccaagggtgc agctgaagcg 
gaacctgttt cgcgaccgcg agggagagga 
cgcagggcgc gagctgcggc atggcctgaa 
tgagcccgac gcgcgaaccg ggattagtcc 
ggtaaccgca tacgagcaga cggtgaacca 
ccacgtgcgt acgcttgtgg cgcgcgagga 
ctttgtaagc gcgctggagc aaaacccaaa 
tatagtgcag cacagcaggg acaacgaggc 
gcccgagggc cgctggctgc tcgatttgat 
gcgcagcttg agcctggctg acaaggtggc 
caagttttac gcccgcaaga tataccatac 
gatcgagggg ttctacatgc gcatggcgct 
cgtttatcgc aacgagcgca tccacaaggc 
cgaccgcgag ctgatgcaca gcctgcaaag 
agaggccgag tcctactttg acgcgggcgc 
cctggaggca gctggggccg gacctgggct 
cggcggcgtg gaggaatatg acgaggacga 
agcggtgatg tttctgatca gatgatgcaa 
ctgcagagcc agccgtccgg ccttaactcc 
atcatgtcgc tgactgcgcg caatcctgac 
ctctccgcaa ttctggaagc ggtggtcccg 
ctggcgatcg taaacgcgct ggccgaaaac 
gtctacgacg cgctgcttca gcgcgtggct 
ctggaccggc tggtggggga tgtgcgcgag 
cagggcaacc tgggctccat ggttgcacta 
gtgccgcggg gacaggagga ctacaccaac 
gagacaccgc aaagtgaggt gtaccagtct 
caaggcctgc agaccgtaaa cctgagccag 
gtgcgggctc ccacaggcga ccgcgcgacc 
ctgttgctgc tgctaatagc gcccttcacg 
ctaggtcact tgctgacact gtaccgcgag 
actttccagg agattacaag tgtcagccgc 
gaggcaaccc taaactacct gctgaccaac 
ttaaacagcg aggaggagcg cattttgcgc 
atgcgcgacg gggtaacgcc cagcgtggcg 
ggcatgtatg cctcaaaccg gccgtttatc 
gccgccgtga accccgagta tttcaccaat 
cctggtttct acaccggggg attcgaggtg 
gacatagacg acagcgtgtt ttccccgcaa 
gagcaggcag aggcggcgct gcgaaaggaa 
ctaggcgctg cggccccgcg gtcagatgct 
cttaccagca ctcgcaccac ccgcccgcgc 
tcgctgctgc agccgcagcg cgaaaaaaac 
gagagcctag tggacaagat gagtagatgg 
ccaggcccgc gcccgcccac ccgtcgtcaa 
gaggacgatg actcggcaga cgacagcagc 
tttgcgcacc ttcgccccag gctggggaga 
aataaaaaac tcaccaaggc catggcaccg 
..t-gcggcgcgc ggcgatgtat gaggaaggtc 
cggcgccagt ggcggcggcg ctgggttctc 
tgcctccgcg gtacctgcgg cctaccgggg 
cacccctatt cgacaccacc cgtgtgtacc 
ccctgaacta ccagaacgac cacagcaact 
acagcccggg ggaggcaagc acacagacca 
gcgacctgaa aaccatcctg cataccaaca 
ataagtttaa ggcgcgggtg atggtgtcgc 
tgaaatacga gtgggtggag ttcacgctgc 
tagaccttat gaacaacgcg atcgtggagc 
ttctggaaag cgacatcggg gtaaagtttg 
ccgtcactgg tcttgtcatg cctggggtat 



tggtgattac gaacccccgc ggcgccgggc 12360 
gggcctggcg cggctaggag cgccctctcc 12420 
tgatacgcgt gaggcgtacg tgccgcggca 124 80 
gcccgaggag atgcgggatc gaaagttcca 12540 
tcgcgagcgg ttgctgcgcg aggaggactt 12600 
cgcgcgcgca cacgtggcgg ccgccgacct 12 6 60 
ggagattaac tttcaaaaaa gctttaacaa 12720 
ggtggctata ggactgatgc atctgtggga 12780 
tagcaagccg ctcatggcgc agctgttcct 128 4 0 
attcagggat gcgctgctaa acatagtaga 12900 
aaacatcctg cagagcatag tggtgcagga 12 9 60 
cgccatcaac tattccatgc ttagcctggg 13020 
cccttacgtt cccatagaca aggaggtaaa 13080 
gaaggtgctt accttgagcg acgacctggg 13140 
cgtgagcgtg agccggcggc gcgagctcag 13200 
ggccctggct ggcacgggca gcggcgatag 132 60 
tgacctgcgc tgggccccaa gccgacgcgc 13320 
ggcggtggca cccgcgcgcg ctggcaacgt 1338 0 
tgagtacgag ccagaggacg gcgagtacta 134 40 
gacgcaacgg acccggcggt gcgggcggcg 13500 
acggacgact ggcgccaggt catggaccgc 135 60 
gcgttccggc agcagccgca ^gccaaccgg 13620 
gcgcgcgcaa accccacgca cgagaaggtg 13680 
agggccatcc ggcccgacga ggccggcctg 13740 
cgttacaaca gcggcaacgt gcagaccaac 13800 
gccgtggcgc agcgtgagcg cgcgcagcag 138 60 
aacgccttcc tgagtacaca gcccgccaac 13920 
tttgtgagcg cactgcggct aatggtgact 13980 
gggccagact attttttcca gaccagtaga 14040 
gctttcaaaa acttgcaggg gctgtggggg 14100 
gtgtctagct tgctgacgcc caactcgcgc 14160 
gacagtggca gcgtgtcccg ggacacatac 14220 
gccataggtc aggcgcatgt ggacgagcat 14280 
gcgctggggc aggaggacac gggcagcctg 1434 0 
cggcggcaga agatcccctc gttgcacagt 14 4 00 
tacgtgcagc agagcgtgag ccttaacctg 14 4 60 
ctggacatga ccgcgcgcaa catggaaccg 14 520 
aaccgcctaa tggactactt gcatcgcgcg 14580 
gccatcttga acccgcactg gctaccgccc 14 64 0 
cccgagggta acgatggatt cctctgggac 14700 
ccgcagaccc tgctagagtt gcaacagcgc 147 60 
agcttccgca ggccaagcag cttgtccgat 14820 
agtagcccat ttccaagctt gatagggtct 14880 
ctgctgggcg aggaggagta cctaaacaac 14 94 0 
ctgcctccgg catttcccaa caacgggata 15000 
aagacgtacg cgcaggagca cagggacgtg 150 60 
aggcacgacc gtcagcgggg tctggtgtgg 15120 
gtcctggatt tgggagggag tggcaacccg 15180 
atgttttaaa aaaaaaaaag catgatgcaa 15240 
agcgttggtt ttcttgtatt ccccttagta 15300 
ctcctccctc ctacgagagt gtggtgagcg 15360 
ccttcgatgc tcccctggac ccgccgtttg 154 20 
ggagaaacag catccgttac tctgagttgg 154 8 0 
tggtggacaa caagtcaacg gatgtggcat 1554 0 
ttctgaccac ggtcattcaa aacaatgact 15600 
tcaatcttga cgaccggtcg cactggggcg 15660 
tgccaaatgt gaacgagttc atgtttacca 15720 
gcttgcctac taaggacaat caggtggagc 15780 
ccgagggcaa ctactccgag accatgacca 158 4 0 
actacttgaa agtgggcaga cagaacgggg 15900 
acacccgcaa cttcagactg gggtttgacc 15960 
atacaaacga agccttccat ccagacatca 16020 
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ttttgctgcc aggatgcggg gtggacttca 
tccgcaagcg gcaacccttc caggagggct 
gtaacattcc cgcactgttg gatgtggacg 
aacagggcgg gggtggcgca ggcggcagca 
ccaacgcggc agccgcggca atgcagccgg 
gcgacacctt tgccacacgg gctgaggaga 
ctgccgcccc cgctgcgcaa cccgaggtcg 
ccctgacaga ggacagcaag aaacgcagtt 
cccagtaccg cagctggtac cttgcataca 
catggaccct gctttgcact cctgacgtaa 
tgccagacat gatgcaagac cccgtgacct 
cggtggtggg cgccgagctg ttgcccgtgc 
tctactccca actcatccgc cagtttacct 
agaaccagat tttggcgcgc ccgccagccc 
ctgctctcac agatcacggg acgctaccgc 
tgaccattac tgacgccaga cgccgcacct 
tctcgccgcg cgtcctatcg agccgcactt 
ccagcaataa cacaggctgg ggcctgcgct 
agcgctccga ccaacaccca gtgcgcgtgc 
acaaacgcgg ccgcactggg cgcaccaccg 
aggcgcgcaa ctacacgccc acgccgccac 
ccgtggtgcg cggagcccgg cgctatgcta 
gtcgccaccg ccgccgaccc ggcactgccg 
gcgcacgtcg caccggccga cgggcggcca 
ttgtcactgt gccccccagg tccaggcgac 
gtgctatgac tcagggtcgc aggggcaacg 
tgcgcgtgcc cgtgcgcacc cgccccccgc 
actcgtactg ttgtatgtat ccagcggcgg 
aaatcaaaga agagatgctc caggtcatcg 
aagagcagga ttacaagccc cgaaagctaa 
atgatgaact tgacgacgag gtggaactgc 
agtggaaagg tcgacgcgta aaacgtgttt 
ccggtgagcg ctccacccgc acctacaagc 
acctgcttga gcaggccaac gagcgcctcg 
acatgctggc gttgccgctg gacgagggca 
tgcagcaggt gctgcccgcg cttgcaccgt 
ctggtgactt ggcacccacc gtgcagctga 
tcttggaaaa aatgaccgtg gaacctgggc 
agcaggtggc gccgggactg ggcgtgcaga 
gcaccagtat tgccaccgcc acagagggca 
cggtggcgga tgccgcggtg caggcggtcg 
tgcaaacgga cccgtggatg tttcgcgttt 
agtacggcgc cgccagcgcg ctactgcccg 
cccccggcta tcgtggctac acctaccgcc 
ccaccactgg aacccgccgc cgccgtcgcc 
tgcgcagggt ggctcgcgaa ggaggcagga 
ccagcatcgt ttaaaagccg gtctttgtgg 
tccgtttccc ggtgccggga ttccgaggaa 
acggcctgac gggcggcatg cgtcgtgcgc 
gcatgcgcgg cggtatcctg cccctcctta 
tgcccggaat tgcatccgtg gccttgcagg 
tgtggaaaaa tcaaaataaa aagtctggac 
tgtagaatgg aagacatcaa ctttgcgtct 
atgggaaact ggcaagatat cggcaccagc 
tcgctgtgga gcggcattaa aaatttcggt 
tggaacagca gcacaggcca gatgctgagg 
aaggtggtag atggcctggc ctctggcatt 
gtgcaaaata agattaacag taagcttgat 
gccgtggaga cagtgtctcc agaggggcgt 
gaaactctgg tgacgcaaat agacgagcct 
ctgcccacca cccgtcccat cgcgcccatg 
gtaacgctgg acctgcctcc ccccgccgac 



cccacagccg cctgagcaac ttgttgggca 16080 
ttaggatcac ctacgatgat ctggagggtg 16140 
cctaccaggc gagcttgaaa gatgacaccg 16200 
acagcagtgg cagcggcgcg gaagagaact 162 60 
tggaggacat gaacgatcat gccattcgcg 16320 
agcgcgctga ggccgaagca gcggccgaag 16380 
agaagcctca gaagaaaccg gtgatcaaac 16440 
acaacctaat aagcaatgac agcaccttca 16500 
actacggcga ccctcagacc ggaatccgct 16560 
cctgcggctc ggagcaggtc tactggtcgt 16620 
tccgctccac gcgccagatc agcaactttc 16680 
actccaagag cttctacaac gaccaggccg 16740 
ctctgaccca cgtgttcaat cgctttcccg 16800 
ccaccatcac caccgtcagt gaaaacgttc 168 60 
tgcgcaacag catcggagga gtccagcgag 16920 
gcccctacgt ttacaaggcc ctgggcatag 16980 
tttgagcaag catgtccatc cttatatcgc 17040 
tcccaagcaa gatgtttggc ggggccaaga 17100 
gcgggcacta ccgcgcgccc tggggcgcgc 17160 
tcgatgacgc catcgacgcg gtggtggagg 17220 
cagtgtccac agtggacgcg gccattcaga 17280 
aaatgaagag acggcggagg egcgtagcac 17340 
cccaacgcgc ggcggcggcc ctgcttaacc 17 4 00 
tgcgggccgc tcgaaggctg gccgcgggta 17 4 60 
gagcggccgc cgcagcagcc gcggccatta 17520 
tgtattgggt gcgcgactcg gttagcggcc 17580 
gcaactagat tgcaagaaaa aactacttag 17 640 
cggcgcgcaa cgaagctatg tccaagcgca 17700 
cgccggagat ctatggcccc ccgaagaagg 177 60 
agcgggtcaa aaagaaaaag aaagatgatg 17820 
tgcacgctac cgcgcccagg cgacgggtac 17880 
tgcgacccgg caccaccgta gtctttacgc 17940 
gcgtgtatga tgaggtgtac ggcgacgagg 18000 
gggagtttgc ctacggaaag cggcataagg 18060 
acccaacacc tagcctaaag cccgtaacac 18120 
ccgaagaaaa gcgcggccta aagcgcgagt 18180 
tggtacccaa gcgccagcga ctggaagatg 18240 
tggagcccga ggtccgcgtg cggccaatca 18300 
ccgtggacgt tcagataccc actaccagta 18360 
tggagacaca aacgtccccg gttgcctcag 18420 
ctgcggccgc gtccaagacc tctacggagg 18480 
cagccccccg gcgcccgcgc ggttcgagga 18540 
aatatgccct acatccttcc attgcgccta 18 600 
ccagaagacg agcaactacc cgacgccgaa 18 660 
gtcgccagcc cgtgctggcc ccgatttccg 18720 
ccctggtgct gccaacagcg cgctaccacc 18780 
ttcttgcaga tatggccctc acctgccgcc 18840 
gaatgcaccg taggaggggc atggccggcc 18 900 
accaccggcg gcggcgcgcg tcgcaccgtc 18960 
ttccactgat cgccgcggcg attggcgccg 19020 
cgc-agagaca ctgattaaaa acaagttgca 19080 
tctcacgctc gcttggtcct gtaactattt 19140 
ctggccccgc gacacggctc gcgcccgttc 19200 
aatatgagcg gtggcgcctt cagctggggc 19260 
tccaccgtta agaactatgg cagcaaggcc 19320 
gataagttga aagagcaaaa tttccaacaa 19380 
agcggggtgg tggacctggc caaccaggca 19440 
ccccgccctc ccgtagagga gcctccaccg 19500 
ggcgaaaagc gtccgcgccc cgacagggaa 19560 
ccctcgtacg aggaggcact aaagcaaggc 19620 
gctaccggag tgctgggcca gcacacaccc 19680 
acccagcaga aacctgtgct gccaggcccg 19740 
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accgccgttg 
cgatcgttgc 
ctgggggtgc 
tgtcatgtat 
ccaagatggc 
acgcctcgga 
tcagcctgaa 
accggtccca 
cgtacaaggc 
cgtactttga 
ctgcctacaa 
ctactgctct 
agcaagctga 
ttacaaagga 
catttcaacc 
cagctgggag 
aacccacaaa 
aaagtcaagt 
acttgactcc 
atatttctta 
ctatgcccaa 
acaacagcac 
tagatttgca 
atagaaccag 
gaattattga 
gtgtgattaa 
gggaaaaaga 
ccatggaaat 
tgtatttgcc 
acacctacga 
accttggagc 
gcaatgctgg 
acatccaggt 
acacctacga 
atgacctaag 
tcttccccat 
acgaccagtc 
acgctaccaa 
ccttcacgcg 
acacctactc 
agaaggtggc 
cccccaacga 
gtaacatgac 
agggcttcta 
agcccatgag 
tcctacacca 
gacaggccta 
ttacccagaa 
ttatgtccat 
acgcgctaga 
tgtttgaagt 
tgtacctgcg 
aacaacagct 
tggttgtggg 
acacaagctc 
gatggccttt 
ttctgaccag 
cgccattgct 
ggggcccaac 
ctggccccaa 
ctccatgctc 
cagcttcctg 



ttgtaacccg 
ggcccgtagc 
aatccctgaa 
gcgtccatgt 
taccccttcg 
gtacctgagc 
taacaagttt 
gcgtttgacg 
gcggttcacc 
catccgcggc 
cgccctggct 
tgaaataaac 
gcagcaaaaa 
gggtattcaa 
tgaacctcaa 
agtccttaaa 
tgaaaatgga 
ggaaatgcaa 
taaagtggta 
catgcccact 
caggcctaat 
gggtaatatg 
agacagaaac 
gtacttttct 
aaatcatgga 
tacagagact 
tgctacagaa 
caatctaaat 
cgacaagcta 
ctacatgaac 
acgctggtcc 
cctgcgctac 
gcctcagaag 
gtggaacttc 
ggttgacgga 
ggcccacaac 
ctttaacgac 
cgtgcccata 
ccttaagact 
tggctctata 
cattaccttt 
gtttgaaatt 
caaagactgg 
tatcccagag 
ccgtcaggtg 
acacaacaac 
ccctgctaac 
aaagtttctt 
gggcgcactc 
catgactttt 
ctttgacgtg 
cacgcccttc 
gccgccatgg 
ccatattttt 
gcctgcgcca 
gcctggaacc 
cgactcaagc 
tcttcccccg 
tcggccgcct 
actcccatgg 
aacagtcccc 
gagcgccact 



tcctagccgc 
cagtggcaac 
gcgccgacga 
cgccgccaga 
atgatgccgc 
cccgggctgg 
agaaacccca 
ctgcggttca 
ctagctgtgg 
gtgctggaca 
cccaagggtg 
ctagaagaag 
actcacgtat 
ataggtgtcg 
ataggagaat 
aagactaccc 
gggcaaggca 
tttttctcaa 
ttgtacagtg 
attaaggaag 
tacattgctt 
ggtgttctgg 
acagagcttt 
atgtggaatc 
actgaagatg 
cttaccaagg 
ttttcagata 
gccaacctgt 
aagtacagtc 
aagcgagtgg 
cttgactata 
cgctcaatgt 
ttctttgcca 
aggaaggatg 
gccagcatta 
accgcctcca 
tatctctccg 
tccatcccct 
aaggaaaccc 
ccctacctag 
gactcttctg 
aagcgctcag 
ttcctggtac 
agctacaagg 
gtggatgata 
tctggatttg 
ttcccctatc 
tgcgatcgca 
acagacctgg 
gaggtggatc 
gtccgtgtgc 
tcggccggca 
gctccagtga 
tgggcaccta 
tagtcaatac 
cgcactcaaa 
aggtttacca 
accgctgtat 
gtggactatt 
atcacaaccc 
aggtacagcc 
cgccctactt 



gcgtccctgc 
tggcaaagca 
tgcttctgaa 
ggagctgctg 
agtggtctta 
tgcagtttgc 
cggtggcgcc 
tccctgtgga 
gtgataaccg 
ggggccctac 
ccccaaatcc 
aggacgatga 
ttgggcaggc 
aaggtcaaac 
ctcagtggta 
caatgaaacc 
ttcttgtaaa 
ctactgaggc 
aagatgtaga 
gtaactcacg 
ttagggacaa 
cgggccaagc 
cataccagct 
aggctgttga 
aacttccaaa 
taaaacctaa 
aaaatgaaat 
ggagaaattt 
cttccaacgt 
tggctcccgg 
tggacaacgt 
tgctgggcaa 
ttaaaaacct 
ttaacatggt 
agtttgatag 
cgcttgaggc 
ccgccaacat 
cccgcaactg 
catcactggg 
atggaacctt 
tcagctggcc 
ttgacgggga 
aaatgctagc 
accgcatgta 
ctaaatacaa 
ttggctacct 
cgcttatagg 
ccctttggcg 
gccaaaacct 
ccatggacga 
accggccgca 
acgccacaac 
gcaggaactg 
tgacaagcgc 
ggccggtcgc 
aacatgctac 
gtttgagtac 
aacgctggaa 
ctgctgcatg 
caccatgaac 
caccctgcgt 
ccgcagccac 



gccgcgccgc 

cactgaacag 

tagctaacgt 

agccgccgcg 

catgcacatc 

ccgcgccacc 

tacgcacgac 

ccgtgaggat 

tgtgctggac 

ttttaagccc 

ttgcgaatgg 

caacgaagac 

gccttattct 

acctaaatat 

cgaaactgaa 

atgttacggt 

gcaacaaaat 

gaccgcaggc 

tatagaaacc 

agaactaatg 

ttttattggt 

atcgcagttg 

tttgcttgat 

cagctatgat 

ttactgcttt 

aacaggtcag 

aagagttgga 

cctgtactcc 

aaaaatttct 

gttagtggac 

caacccattt 

tggtcgctat 

ccttctcctg 

tctgcagagc 

catttgcctt 

catgcttaga 

gctctaccct 

ggcggctttc 

ctcgggctac 

ttacctcaac 

tggcaatgac 

gggttacaac 

taactacaac 

ctccttcttt 

ggactaccaa 

tgcccccacc 

caagaccgca 

catcccattc 

tctctacgcc 

gcccaccctt 

ccgcggcgtc 

ataaagaagc 

aaagccattg 

tttccaggct 

gagactgggg 

ctctttgagc 

gagtcactcc 

aagtccaccc 

tttctccacg 

cttattaccg 

cgcaaccagg 

agtgcgcaga 



cagcggtccg 

catcgtgggt 

gtcgtatgtg 

cgcccgcttt 

tcgggccagg 

gagacgtact 

gtgaccacag 

actgcgtact 

atggcttcca 

tactctggca 

gatgaagctg 

gaagtagacg 

ggtataaata 

gccgataaaa 

attaatcatg 

tcatatgcaa 

ggaaagctag 

aatggtgata 

ccagacactc 

ggccaacaat 

ctaatgtatt 

aatgctgttg 

tccattggtg 

ccagatgtta 

ccactgggag 

gaaaatggat 

aataattttg 

aacatagcgc 

gataacccaa 

tgctacatta 

aaccaccacc 

gtgcccttcc 

ccgggctcat 

tccctaggaa 

tacgccacct 

aacgacacca 

atacccgcca 

cgcggctggg 

gacccttatt 

cacaccttta 

cgcctgctta 

gttgcccagt 

attggctacc 

agaaacttcc 

caggtgggca 

atgcgcgaag 

gttgacagca 

tccagtaact 

aactccgccc 

ctttatgttt 

atcgaaaccg 

aagcaacatc 

tcaaagatct 

ttgtttctcc 

gcgtacactg 

cctttggctt 

tgcgccgtag 

aaagcgtaca 

cctttgccaa 

gggtacccaa 

aacagctcta 

ttaggagcgc 



19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700 

20760 

20820 

20880 

20940 

21000 

21060 

21120 

21180 

21240 

21300 

21360 

21420 

21480 

21540 

21600 

21660 

21720 

21780 

21840 

21900 

21960 

22020 

22080 

22140 

22200 

22260 

22320 

22380 

22440 

22500 

22560 

22620 

22680 

22740 

22800 

22860 

22920 

22980 

23040 

23100 

2316~0 

23220 

23280 

23340 

23400 

23460 
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cacttctttt tgtcacttga aaaacatgta 
aggcaaatgc ttttatttgt acactctcgg 
cgccgtttaa aaatcaaagg ggttctgccg 
gttgcgatac tggtgtttag tgctccactt 
ggtgaagttt tcactccaca ggctgcgcac 
cgatatcttg aagtcgcagt tggggcctcc 
gttgcagcac tggaacacta tcagcgccgg 
ggagatcaga tccgcgtcca ggtcctccgc 
tagctgcctt cccaaaaagg gcgcgtgccc 
catcaaaagg tgaccgtgcc cggtctgggc 
gatctgctta aaagccacct gagcctttgc 
gccggaaaac tgattggccg gacaggccgc 
ggagatctgc accacatttc ggccccaccg 
ctccttcagc gcgcgctgcc cgttttcgct 
atttatcata atgcttccgt gtagacactt 
cagccacaac gcgcagcccg tgggctcgtg 
caggtacgcc tgcaggaatc gccccatcat 
cagctgcaac ccgcggtgct cctcgttcag 
cacttggtca ggcagtagtt tgaagttcgc 
catcagcgcg cgcgcagcct ccatgccctt 
cgggttcatc accgtaattt cactttccgc 
ccgcatacca cgcgccactg ggtcgtcttc 
tttgccatgc ttgattagca ccggtgggtt 
ttctctttct tcctcgctgt ccacgattac 
agaagggcgc ttctttttct tcttgggcgc 
ccgcgggctg ggtgtgcgcg gcaccagcgc 
ctcgatacgc cgcctcatcc gcttttttgg 
cggggacgac acgtcctcca tggttggggg 
ggtggtttcg cgctgctcct cttcccgact 
gatcatggag tcagtcgaga agaaggacag 
cgcctccacc gatgccgcca acgcgcctac 
ggaggaggaa gtgattatcg agcaggaccc 
ctcagtacca acagaggata aaaagcaaga 
agtcgggcgg ggggacgaaa ggcatggcga 
gaagcatctg cagcgccagt gcgccattat 
gcccctcgcc atagcggatg tcagccttgc 
accccccaaa cgccaagaaa acggcacatg 
cgtatttgcc gtgccagagg tgcttgccac 
acccctatcc tgccgtgcca accgcagccg 
cgctgtcata cctgatatcg cctcgctcaa 
acgcgacgag aagcgcgcgg caaacgctct 
ctctggagtg ttggtggaac tcgagggtga 
catcgaggtc acccactttg cctacccggc 
agtcatgagt gagctgatcg tgcgccgtgc 
agaacaaaca gaggagggcc tacccgcagt 
aacgcgcgag cctgccgact tggaggagcg 
taccgtggag cttgagtgca tgcagcggtt 
agaggaaaca ttgcactaca cctttcgaca 
caacgtggag ctctgcaacc tggtctccta 
gcaaaacgtg cttcattcca cgctcaaggg 
cgtttactta tttctatgct acacctggca 
ggaggagtgc aacctcaagg agctgcagaa 
gacggccttc aacgagcgct ccgtggccgc 
cctgcttaaa accctgcaac agggtctgcc 
ctttaggaac tttatcctag agcgctcagg 
tagcgacttt gtgcccatta agtaccgcga 
ccttctgcag ctagccaact accttgccta 
tgacggtcta ctggagtgtc actgtcgctg 
ttgcaattcg cagctgctta acgaaagtca 
ctcgcctgac gaaaagtccg cggctccggg 
ggcttacctt cgcaaatttg tacctgagga 
agaccaatcc cgcccgccaa atgcggagct 



aaaataatgt actagagaca ctttcaataa 23520 
gtgattattt acccccaccc ttgccgtctg 23580 
cgcatcgcta tgcgccactg gcagggacac 23640 
aaactcaggc acaaccatcc gcggcagctc 23700 
catcaccaac gcgtttagca ggtcgggcgc 23760 
gccctgcgcg cgcgagttgc gatacacagg 23820 
gtggtgcacg ctggccagca cgctcttgtc 23880 
gttgctcagg gcgaacggag tcaactttgg 23940 
aggctttgag ttgcactcgc accgtagtgg 24000 
gttaggatac agcgcctgca taaaagcctt 24060 
gccttcagag aagaacatgc cgcaagactt 24120 
gtcgtgcacg cagcaccttg cgtcggtgtt 24180 
gttcttcacg atcttggcct tgctagactg 24240 
cgtcacatcc atttcaatca cgtgctcctt 24300 
aagctcgcct tcgatctcag cgcagcggtg 24360 
atgcttgtag gtcacctctg caaacgactg 24420 
cgtcacaaag gtcttgttgc tggtgaaggt 24480 
ccaggtcttg catacggccg ccagagcttc 24540 
ctttagatcg ttatccacgt ggtacttgtc 24 600 
ctcccacgca gacacgatcg gcacactcag 24 660 
ttcgctgggc tcttcctctt cctcttgcgt 24720 
attcagccgc cgcactgtgc gcttacctcc 24780 
gctgaaaccc accatttgta gcgccacatc 24840 
ctctggtgat ggcgggcgct cgggcttggg 24 900 
aatggccaaa tccgccgccg aggtcgatgg 24 960 
gtcttgtgat gagtcttcct cgtcctcgga 25020 
gggcgcccgg ggaggcggcg gcgacgggga 25080 
acgtcgcgcc gcaccgcgtc cgcgctcggg 25140 
ggccatttcc ttctcctata ggcagaaaaa 25200 
cctaaccgcc ccctctgagt tcgccaccac 25260 
caccttcccc gtcgaggcac ccccgcttga 25320 
aggttttgta agcgaagacg acgaggaccg 25380 
ccaggacaac gcagaggcaa acgaggaaca 25440 
ctacctagat gtgggagacg acgtgctgtt 25500 
ctgcgacgcg ttgcaagagc gcagcgatgt 25560 
ctacgaacgc cacctattct caccgcgcgt 25 620 
cgagcccaac ccgcgcctca acttctaccc 25680 
ctatcacatc tttttccaaa actgcaagat 25740 
agcggacaag cagctggcct tgcggcaggg 25800 
cgaagtgcca aaaatctttg agggtcttgg 258 60 
gcaacaggaa aacagcgaaa atgaaagtca 25920 
caacgcgcgc ctagccgtac taaaacgcag 25980 
acttaaccta ccccccaagg tcatgagcac 26040 
gcagcccctg gagagggatg caaatttgca 2 6100 
tggcgacgag cagctagcgc gctggcttca 26160 
acgcaaacta atgatggccg cagtgctcgt 26220 
ctttgctgac ccggagatgc agcgcaagct 26280 
gggctacgta cgccaggcct gcaagatctc 26340 
ccttggaatt ttgcacgaaa accgccttgg 2 64 00 
cgaggcgcgc cgcgactacg tccgcgactg 2 64 60 
gacggccatg ggcgtttggc agcagtgctt 26520 
actgctaaag caaaacttga aggacctatg 26580 
gcacctggcg gacatcattt tccccgaacg 2 664 0 
agacttcacc agtcaaagca tgttgcagaa 26700 
aatcttgccc gccacctgct gtgcacttcc 267 60 
atgccctccg ccgctttggg gccactgcta 26820 
ccactctgac ataatggaag acgtgagcgg 26880 
caacctatgc accccgcacc gctccctggt 26940 
aattatcggt acctttgagc tgcagggtcc 27000 
gttgaaactc actccggggc tgtggacgtc 27 0 60 
ctaccacgcc cacgagatta ggttctacga 27120 
taccgcctgc gtcattaccc agggccacat 27180 



tcttggccaa ttgcaagcca tcaacaaagc 
gggggtttac ttggaccccc agtccggcga 
gccctatcag cagcagccgc gggcccttgc 
agctgccgcc gccacccacg gacgaggagg 
tggacgagga ggaggaggac atgatggaag 
aggtcgaaga ggtgtcagac gaaacaccgt 
cccagaaatc ggcaaccggt tccagcatgg 
cactgcccgt tcgccgaccc aaccgtagat 
ccaagcagcc gccgccgtta gcccaagagc 
gcgggcacaa gaacgccata gttgcttgct 
cccgccgctt tcttctctac catcacggcg 
accgtcatct ctacagccca tactgcaccg 
gccacacaga agcaaaggcg accggatagc 
gcggcggcag cagcaggagg aggagcgctg 
cgcgagctta gaaacaggat ttttcccact 
caagaacaag agctgaaaat aaaaaacagg 
tatcacaaaa gcgaagatca gcttcggcgc 
aaatactgcg cgctgactct taaggactag 
aaactacgtc atctccagcg gccacacccg 
agcaaggaaa ttcccacgcc ctacatgtgg 
ggagctgccc aagactactc aacccgaata 
tcccgggtca acggaatccg cgcccaccga 
accaccacac ctcgtaataa ccttaatccc 
gaaagtcccg ctcccaccac tgtggtactt 
actaactcag gggcgcagct tgcgggcggc 
ggtataactc acctgacaat cagagggcga 
tcctcgcttg gtctccgtcc ggacgggaca 
ttcacgcctc gtcaggcaat cctaactctg 
ggcattggaa ctctgcaatt tattgaggag 
tcgggacctc ccggccacta tccggatcaa 
tcggcggacg gctacgactg aatgttaagt 
ctggtccact gtcgccgcca caagtgcttt 
gaattgcccg aggatcatat cgagggcccg 
gagcttgccc gtagcctgat tcgggagttt 
aggggaccct gtgttctcac tgtgatttgc 
ttaattaatt gccacatcct cttacacttt 
ttgtgttatg tttcaacgtg tttatttttc 
tcagtagtat agccccacca ccacatagct 
acagaaccct agtattcaac ctgccacctc 
ctccccggct ggccttaaaa agcatcatat 
tattccacac ggtttcctgt cgagccaaac 
gcagctcact taagttcatg tcgctgtcca 
gcggttgctt aacgggcggc gaaggagaag 
cgtgcatcag gatagggcgg tggtgctgca 
gctccgtcct gcaggaatac aacatggcag 
gcagcataag gcgccttgtc ctccgggcac 
cacagtaact gcagcacagc accacaatat 
atccaaagct catggcgggg accacagaac 
agattaagtg gcgacccctc ataaacacgc 
tgtaattcac cacctcccgg taccatataa 
ccatcctaaa ccagctggcc aaaacctgcc 
tggaacaatg acagtggaga gcccaggact 
tatcaatgtt ggcacaacac aggcacacgt 
cccgcgttag aaccatatcc cagggaacaa 
tgcagggaag acctcgcacg taactcacgt 
gcagcggatg atcctccagt atggtagcgc 
ccctactgta cggagtgcgc cgagacaacc 
atggaacgcc ggacgtagtc atatttcctg 
gatctgcgtc tccggtctcg ccgcttagat 
tctctcaaag catccaggcg ccccctggct 
gctgccctga taacatccac caccgcagaa 
ttctgcgagt cacacacggg aggagcggga 
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ccgccaagag tttctgctac gaaagggacg 27240 
ggagctcaac ccaatccccc cgccgccgca 27300 
ttcccaggat ggcacccaaa aagaagctgc 27360 
aatactggga cagtcaggca gaggaggttt 27420 
actgggagag cctagacgag gaagcttccg 27480 
caccctcggt cgcattcccc tcgccggcgc 27540 
ctacaacctc cgctcctcag gcgccgccgg 27600 
gggacaccac tggaaccagg gccggtaagt 27 660 
aacaacagcg ccaaggctac cgctcatggc 27720 
tgcaagactg tgggggcaac atctccttcg 27780 
tggccttccc ccgtaacatc ctgcattact 27840 
gcggcagcgg cagcggcagc aacagcagcg 27900 
aagactctga caaagcccaa gaaatccaca 27 960 
cgtctggcgc ccaacgaacc cgtatcgacc 28020 
ctgtatgcta tatttcaaca gagcaggggc 28 080 
tctctgcgat ccctcacccg cagctgcctg 28140 
acgctggaag acgcggaggc tctcttcagt 28200 
tttcgcgccc tttctcaaat ttaagcgcga 28260 
gcgccagcac ctgtcgtcag cgccattatg 28320 
agttaccagc cacaaatggg acttgcggct 28380 
aactacatga gcgcgggacc ccacatgata 28440 
aaccgaattc tcttggaaca ggcggctatt 28500 
cgtagttggc ccgctgccct cfgtgtaccag 28560 
cccagagacg cccaggccga agttcagatg 28 620 
tttcgtcaca gggtgcggtc gcccgggcag 28 680 
ggtattcagc tcaacgacga gtcggtgagc 28740 
tttcagatcg gcggcgccgg ccgtccttca 28800 
cagacctcgt cctctgagcc gcgctctgga 288 60 
tttgtgccat cggtctactt taaccccttc 28920 
tttattccta actttgacgc ggtaaaggac 28980 
ggagaggcag agcaactgcg cctgaaacac 29040 
gcccgcgact ccggtgagtt ttgctacttt 29100 
gcgcacggcg tccggcttac cgcccaggga 29160 
acccagcgcc ccctgctagt tgagcgggac 29220 
aactgtccta accttggatt acatcaagat 29280 
ttcatacatt gcccaagaat aaagaatcgt 29340 
aattgcagaa aatttcaagt catttttcat 29400 
tatacagatc accgtacctt aatcaaactc 294 60 
cctcccaaca cacagagtac acagtccttt 29520 
catgggtaac agacatattc ttaggtgtta 29580 
gctcatcagt gatattaata aactccccgg 29640 
gctgctgagc cacaggctgc tgtccaactt 29700 
tccacgccta catgggggta gagtcataat 2 97 60 
gcagcgcgcg aataaactgc tgccgccgcc 29820 
tggtctcctc agcgatgatt cgcaccgccc 2 98 80 
agcagcgcac cctgatctca cttaaatcag 29940 
tgttcaaaat cccacagtgc aaggcgctgt 30000 
ccacgtggcc atcataccac aagcgcaggt 30060 
tggacataaa cattacctct tttggcatgt 30120 
acctctgatt aaacatggcg ccatccacca 30180 
cgccggctat acactgcagg gaaccgggac 30240 
cgtaaccatg gatcatcatg ctcgtcatga 30300 
gcatacactt cctcaggatt acaagctcct 30360 
cccattcctg aatcagcgta aatcccacac 30420 
tgtgcattgt caaagtgtta cattcgggca 30480 
gggtttctgt ctcaaaagga ggtagacgat 3054 0 
gagatcgtgt tggtcgtagt gtcatgccaa 30600 
aagcaaaacc aggtgcgggc gtgacaaaca 30660 
cgctctgtgt agtagttgta gtatatccac 30720 
tcgggttcta tgtaaactcc ttcatgcgcc 30780 
taagccacac ccagccaacc tacacattcg 30840 
agagctggaa gaaccatgtt ttttttttta 30900 
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ttccaaaaga ttatccaaaa cctcaaaatg aagatctatt 
ggtggcgtgg tcaaactcta cagccaaaga acagataatg 
aatggcttcc aaaaggcaaa cggccctcac gtccaagtgg 
agggtgaatc tcctctataa acattccagc accttcaacc 
tcgccacctt ctcaatatat ctctaagcaa atcccgaata 
aatctgctcc agagcgccct ccaccttcag cctcaagcag 
tcaggttcct cacagacctg tataagattc aaaagcggaa 
tcccgtaggt cccttcgcag ggccagctga acataatcgt 
gcggccactt ccccgccagg aaccttgaca aaagaaccca 
ctcggagcta tgctaaccag cgtagccccg atgtaagctt 
aaaatgcaag gtgctgctca aaaaatcagg caaagcctcg 
gtagtcatgc tcatgcagat aaaggcaggt aagctccgga 
catttttctc tcaaacatgt ctgcgggttt ctgcataaac 
aacatttaaa cattagaagc ctgtcttaca acaggaaaaa 
cggactacgg ccatgccggc gtgaccgtaa aaaaactggt 
accgacagct cctcggtcat gtccggagtc ataatgtaag 
tgattcatcg gtcagtgcta aaaagcgacc gaaatagccc 
gcgtagagac aacattacag cccccatagg aggtataaca 
cacataaaca cctgaaaaac cctcctgcct aggcaaaata 
aacatacagc gcttcacagc ggcagcctaa cagtcagcct 
tattaaaaaa acaccactcg acacggcacc agctcaatca 
caagtgcaga gcgagtatat ataggactaa aaaatgacgt 
aacacccaga aaaccgcacg cgaacctacg cccagaaacg 
ttcctcaaat cgtcacttcc gttttcccac gttacgtaac 
caattcccaa cacatacaag ttactccgcc ctaaaaccta 
cgccccgcgc cacgtcacaa actccacccc ctcattatca 
aaggtatatt attgatgatg 



<210> 28 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 28 

ctcaacaatt gtggatccgt actcc 



<210> 29 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 29 

gtgctcagca gatcttgcga ctgtg 



<210> 30 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



aagtgaacgc gctcccctcc 30960 
gcatttgtaa gatgttgcac 31020 
acgtaaaggc taaacccttc 31080 
atgcccaaat aattctcatc 31140 
ttaagtccgg ccattgtaaa 31200 
cgaatcatga ttgcaaaaat 31260 
cattaacaaa aataccgcga 31320 
gcaggtctgc acggaccagc 31380 
cactgattat gacacgcata 3144 0 
tgttgcatgg gcggcgatat 31500 
cgcaaaaaag aaagcacatc 31560 
accaccacag aaaaagacac 31620 
acaaaataaa ataacaaaaa 31680 
caacccttat aagcataaga 31740 
caccgtgatt aaaaagcacc 31800 
actcggtaaa cacatcaggt 318 60 
gggggaatac atacccgcag 31920 
aaattaatag gagagaaaaa 31980 
gcaccctccc gctccagaac 3204 0 
taccagtaaa aaagaaaacc 32100 
gtcacagtgt aaaaaagggc 32160 
aacggttaaa gtccacaaaa 32220 
aaagccaaaa aacccacaac 32280 
ttcccatttt aagaaaacta 32340 
cgtcacccgc cccgttccca 32400 
tattggcttc aatccaaaat 32460 



<400> 30 
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ggcgcgttcg gatccactct cttcc 



25 



<210> 31 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 31 

ctacatgcta ggcagatctc gttcggag 



28 



<210> 32 
<211> 1240 
<212> DNA 
<213> adenovirus 



<400> 32 

ggatccactc 

ctgaaaagcg 

gatattcacc 

gacaatcttt 

cttggcgatg 

gtttagctgc 

gtcgggcacc 

ggctacctct 

gaatggcggt 

cccgggcagc 

ctgccatgcg 

ggggtgggtg 

gagtattcca 

gtatagttcg 

tgctcggaag 

gaagacgttg 

gtcgcgcagc 

ggtttccttg 

gacaaactct 

agatccgtac 

t ctcgagaaa 



tcttccgcat 
ggcatgactt 
tggcccgcgg 
ttgttgtcaa 
gagcgcaggg 
acgtattcgc 
aggtgcacgc 
ccgcgtaggc 
agggggtcta 
aggcgcgcgt 
cgggcggcaa 
agcgcggagg 
agatatgtag 
tgcgagggag 
actatctgcc 
aagctggcgt 
ttgttgacca 
atgatgtcat 
tcgcggtctt 
tccgccgccg 
ggcgtctaac 



cgctgtctgc 
ctgcgctaag 
tgatgccttt 
gcttggtggc 
tttggttttt 
gcgcaacgca 
gccaaccgcg 
gctcgttggt 
gctgcgtctc 
cgaagtagtc 
gcgcgcgctc 
cgtacatgcc 
ggtagcatct 
cgaggaggtc 
tgaagatggc 
ctgtgagacc 
gctcggcggt 
acttatcctg 
tccagtactc 
agggacctga 
cagtcacagt 



gagggccagc 
attgtcagtt 
gagggtggcc 
aaacgacccg 
gtcgcgatcg 
ccgccattcg 
gttgtgcagg 
ccagcagagg 
gtccgggggg 
tatcttgcat 
gtatgggttg 
gcaaatgtcg 
tccaccgcgg 
gggaccgagg 
atgtgagttg 
taccgcgtca 
gacctgcacg 
tccctttttt 
ttggatcgga 
gcgagtccgc 
cgcaagatct 



tgttggggtg 
tccaaaaacg 
gcatccatct 
tagagggcgt 
gcgcgctcct 
ggaaagacgg 
gtgacaaggt 
cggccgccct 
tctgcgtcca 
ccttgcaagt 
agtgggggac 
taaacgtaga 
atgctggcgc 
ttgctacggg 
gatgatatgg 
cgcacgaagg 
tctagggcgc 
ttccacagct 
aacccgtcgg 
atcgaccgga 



agtactccct 
aggaggattt 
ggtcagaaaa 
tggacagcaa 
tggccgcgat 
tggtgcgctc 
caacgctggt 
tgcgcgagca 
cggtaaagac 
ctagcgcctg 
cccatggcat 
ggggctctct 
gcacgtaatc 
cgggctgctc 
ttggacgctg 
aggcgtagga 
agtagtccag 
cgcggttgag 
cctccgaacg 
tcggaaaacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1240 



<210> 33 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 33 

ggcgcgttcg gatccactct cttcc 



25 



<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: primer 
<400> 34 

gggagtagat ctcccaacag 

<210> 35 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 35 

cccttttttt tggatccctc gcgg 



<210> 36 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

ctacatgcta ggcagatctc gttcggag 



<210> 37 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

* <400> 37 
ctcaacaatt gttggatccg tactcc 



<210> 38 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 38 

gtgctcagca gatcttgcga ctgtg 



<210> 39 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 39 

ggcgcgttcg gatccactct cttcc 



<210> 40 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 40 

ctacatgcta ggcagatctc gttcggag 



<210> 41 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

cccttttttt tggatccctc gcgg 



<210> 42 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 42 

gtgctcagca gatcttgcga ctgtg 



<210> 43 
<211> 8383 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmxd 



25 



28 



24 



25 



60 



<400> 43 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 24 0 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 54 0 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
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gtaggcgtgt acggtgggag gtctatataa 
ctgcttactg gcttatcgaa attaatacga 
gagctcggat ccactctctt ccgcatcgct 
ctccctctga aaagcgggca tgacttctgc 
ggatttgata ttcacctggc ccgcggtgat 
agaaaagaca atctttttgt tgtcaagctt 
cagcaacttg gcgatggagc gcagggtttg 
cgcgatgttt agctgcacgt attcgcgcgc 
gcgctcgtcg ggcaccaggt gcacgcgcca 
gctggtggct acctctccgc gtaggcgctc 
cgagcagaat ggcggtaggg ggtctagctg 
aaagaccccg ggcagcaggc gcgcgtcgaa 
cgcctgctgc catgcgcggg cggcaagcgc 
tggcatgggg tgggtgagcg cggaggcgta 
ctctctgagt attccaagat atgtagggta 
gtaatcgtat agttcgtgcg agggagcgag 
ctgctctgct cggaagacta tctgcctgaa 
acgctggaag acgttgaagc tggcgtctgt 
gtaggagtcg cgcagcttgt tgaccagctc 
gtccagggtt tccttgatga tgtcatactt 
gttgaggaca aactcttcgc ggtctttcca 
cgaacgagat ccgtactccg ccgccgaggg 
aaaacctctc gagaaaggcg tctaaccagt 
aagaccgtct gaagatacct tcaaccccgt 
aactgtgcct tttcttactc ctccctttgt 
tggggtactc tctttgcgcc tatccgaacc 
caaaatgggc aacggcctct ctctggacga 
cactgtgagc ccacctctca aaaaaaccaa 
cctcacagtt acctcagaag ccctaactgt 
caacacactc accatgcaat cacaggcccc 
tgccacccaa ggacccctca cagtgtcaga 
cctcaccacc accgatagca gtacccttac 
cactggtagc ttgggcattg acttgaaaga 
actaaagtac ggggctcctt tgcatgtaac 
tggtccaggt gtgactatta ataatacttc 
ttttgattca caaggcaata tgcaacttaa 
aaacagacgc cttatacttg atgttagtta 
aagactagga cagggccctc tttttataaa 
caaaggcctt tacttgttta cagcttcaaa 
cactgccaag gggttgatgt ttgacgctac 
tgaatttggt tcacctaatg caccaaacac 
cctagaattt gattcaaaca aggctatggt 
cagcacaggt gccattacag taggaaacaa 
accagctcca tctcctaact gtagactaaa 
cttaacaaaa tgtggcagtc aaatacttgc 
tttggctcca atatctggaa cagttcaaag 
tggagtgcta ctaaacaatt ccttcctgga 
tcttactgaa ggcacagcct atacaaacgc 
tccaaaatct cacggtaaaa ctgccaaaag 
agacaaaact aaacctgtaa cactaaccat 
cacaactcca agtgcatact ctatgtcatt 
taatgaaata tttgccacat cctcttacac 
gcggccgctc gagcatgcat ctagagggcc 
ctcgctgatc agcctcgact gtgccttcta 
ccgtgccttc cttgaccctg gaaggtgcca 
aaattgcatc gcattgtctg agtaggtgtc 
acagcaaggg ggaggattgg gaagacaata 
tggcttctga ggcggaaaga accagctggg 
gcggcgcatt aagcgcggcg ggtgtggtgg 
gcgccctagc gcccgctcct ttcgctttct 
ttccccgtca agctctaaat cggggcatcc 
acctcgaccc caaaaaactt gattagggtg 



gcagagctct ctggctaact agagaaccca 840 
ctcactatag ggagacccaa gcttggtacc 900 
gtctgcgagg gccagctgtt ggggtgagta 9 60 
gctaagattg tcagtttcca aaaacgagga 1020 
gcctttgagg gtggccgcat ccatctggtc 1080 
ggtggcaaac gacccgtaga gggcgttgga 114 0 
gtttttgtcg cgatcggcgc gctccttggc 1200 
aacgcaccgc cattcgggaa agacggtggt 12 60 
accgcggttg tgcagggtga caaggtcaac 1320 
gttggtccag cagaggcggc cgcccttgcg 1380 
cgtctcgtcc ggggggtctg cgtccacggt 1440 
gtagtctatc ttgcatcctt gcaagtctag 1500 
gcgctcgtat gggttgagtg ggggacccca 1560 
catgccgcaa atgtcgtaaa cgtagagggg 1620 
gcatcttcca ccgcggatgc tggcgcgcac 1680 
gaggtcggga ccgaggttgc tacgggcggg 17 4 0 
gatggcatgt gagttggatg atatggttgg 1800 
gagacctacc gcgtcacgca cgaaggaggc 18 60 
ggcggtgacc tgcacgtcta gggcgcagta 1920 
atcctgtccc ttttttttcc acagctcgcg 1980 
gtactcttgg atcggaaacc cgtcggcctc 2040 
acctgagcga gtccgcatcg accggatcgg 2100 
cacagtcgca agatccaaga tgaagcgcgc 2160 
gtatccatat gacacggaaa ccggtcctcc 2220 
atcccccaat gggtttcaag agagtccccc 2280 
tctagttacc tccaatggca tgcttgcgct 2340 
ggccggcaac cttacctccc aaaatgtaac 2400 
gtcaaacata aacctggaaa tatctgcacc 24 60 
ggctgccgcc gcacctctaa tggtcgcggg 2520 
gctaaccgtg cacgactcca aacttagcat 2580 
aggaaagcta gccctgcaaa catcaggccc 2640 
tatcactgcc tcaccccctc taactactgc 2700 
gcccatttat acacaaaatg gaaaactagg 2760 
agacgaccta aacactttga ccgtagcaac 2820 
cttgcaaact aaagttactg gagccttggg 2880 
tgtagcagga ggactaagga ttgattctca 2940 
tccgtttgat gctcaaaacc aactaaatct 3000 
ctcagcccac aacttggata ttaactacaa 3060 
caattccaaa aagcttgagg ttaacctaag 3120 
agccatagcc attaatgcag gagatgggct 3180 
aaatcccctc aaaacaaaaa ttggccatgg 3240 
tcctaaacta ggaactggcc ttagttttga 3300 
aaataatgat aagctaactt tgtggaccac 3360 
tgcagagaaa gatgctaaac tcactttggt 3420 
tacagtttca gttttggctg ttaaaggcag 3480 
tgctcatctt attataagat ttgacgaaaa 3540 
cccagaatat tggaacttta gaaatggaga 3600 
tgttggattt atgcctaacc tatcagctta 3660 
taacattgtc agtcaagttt acttaaacgg 3720 
tacactaaac ggtacacagg aaacaggaga 3780 
ttcatgggac tggtctggcc acaactacat 3840 
tttttcatac attgcccaag aataaaagaa 3900 
ctattctata gtgtcaccta aatgctagag 3960 
gttgccagcc atctgttgtt tgcccctccc 4020 
ctcccactgt cctttcctaa taaaatgagg 4080 
attctattct ggggggtggg gtggggcagg 414 0_ 
gcaggcatgc tggggatgcg gtgggctcta 4200 
gctctagggg gtatccccac gcgccctgta 4260 
ttacgcgcag cgtgaccgct acacttgcca 4320 
tcccttcctt tctcgccacg ttcgccggct 4380 
ctttagggtt ccgatttagt gctttacggc 4440 
atggttcacg tagtgggcca tcgccctgat 4500 
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agacggtttt 
aaactggaac 
ggatttcggc 
tctgtggaat 
gtatgcaaag 
cagcaggcag 
taactccgcc 
gactaatttt 
agtagtgagg 
tatccatttt 
atggattgca 
cacaacagac 
cggttctttt 
cgcggctatc 
ctgaagcggg 
ctcaccttgc 
cgcttgatcc 
gtactcggat 
tcgcgccagc 
tcgtgaccca 
gattcatcga 
cccgtgatat 
gtatcgccgc 
gagcgggact 
tttcgattcc 
cggctggatg 
gtttattgca 
agcatttttt 
tgtctgtata 
tgtgtgaaat 
taaagcctgg 
cgctttccag 
gagaggcggt 
ggtcgttcgg 
agaatcaggg 
ccgtaaaaag 
caaaaatcga 
gtttccccct 
cctgtccgcc 
tctcagttcg 
gcccgaccgc 
cttatcgcca 
tgctacagag 
tatctgcgct 
caaacaaacc 
aaaaaaagga 
cgaaaactca 
ccttttaaat 
tgacagttac 
atccatagtt 
tggccccagt 
aataaaccag 
catccagtct 
gcgcaacgtt 
ttcattcagc 
aaaagcggtt 
atcactcatg 
cttttctgtg 
gagttgctct 
agtgctcatc 
gagatccagt 
caccagcgtt 



tcgccctttg 
aacactcaac 
ctattggtta 
gtgtgtcagt 
catgcatctc 
aagtatgcaa 
catcccgccc 
ttttatttat 
aggctttttt 
cggatctgat 
cgcaggttct 
aatcggctgc 
tgtcaagacc 
gtggctggcc 
aagggactgg 
tcctgccgag 
ggctacctgc 
ggaagccggt 
cgaactgttc 
tggcgatgcc 
ctgtggccgg 
tgctgaagag 
tcccgattcg 
ctggggttcg 
accgccgcct 
atcctccagc 
gcttataatg 
tcactgcatt 
ccgtcgacct 
tgttatccgc 
ggtgcctaat 
tcgggaaacc 
ttgcgtattg 
ctgcggcgag 
gataacgcag 
gccgcgttgc 
cgctcaagtc 
ggaagctccc 
tttctccctt 
gtgtaggtcg 
tgcgccttat 
ctggcagcag 
ttcttgaagt 
ctgctgaagc 
accgctggta 
tctcaagaag 
cgttaaggga 
taaaaatgaa 
caatgcttaa 
gcctgactcc 
gctgcaatga 
ccagccggaa 
attaattgtt 
gttgccattg 
tccggttccc 
agctccttcg 
gttatggcag 
actggtgagt 
tgcccggcgt 
attggaaaac 
tcgatgtaac 
tctgggtgag 



acgttggagt 
cctatctcgg 
aaaaatgagc 
tagggtgtgg 
aattagtcag 
agcatgcatc 
ctaactccgc 
gcagaggccg 
ggaggcctag 
caagagacag 
ccggccgctt 
tctgatgccg 
gacctgtccg 
acgacgggcg 
ctgctattgg 
aaagtatcca 
ccattcgacc 
cttgtcgatc 
gccaggctca 
tgcttgccga 
ctgggtgtgg 
cttggcggcg 
cagcgcatcg 
aaatgaccga 
tctatgaaag 
gcggggatct 
gttacaaata 
ctagttgtgg 
ctagctagag 
tcacaattcc 
gagtgagcta 
tgtcgtgcca 
ggcgctcttc 
cggtatcagc 
gaaagaacat 
tggcgttttt 
agaggtggcg 
tcgtgcgctc 
cgggaagcgt 
ttcgctccaa 
ccggtaacta 
ccactggtaa 
ggtggcctaa 
cagttacctt 
gcggtggttt 
atcctttgat 
ttttggtcat 
gttttaaatc 
tcagtgaggc 
ccgtcgtgta 
taccgcgaga 
gggccgagcg 
gccgggaagc 
ctacaggcat 
aacgatcaag 
gtcctccgat 
cactgcataa 
actcaaccaa 
caatacggga 
gttcttcggg 
ccactcgtgc 
caaaaacagg 



ccacgttctt 
tctattcttt 
tgatttaaca 
aaagtcccca 
caaccaggtg 
tcaattagtc 
ccagttccgc 
aggccgcctc 
gcttttgcaa 
gatgaggatc 
gggtggagag 
ccgtgttccg 
gtgccctgaa 
ttccttgcgc 
gcgaagtgcc 
tcatggctga 
accaagcgaa 
aggatgatct 
aggcgcgcat 
atatcatggt 
cggaccgcta 
aatgggctga 
ccttctatcg 
ccaagcgacg 
gttgggcttc 
catgctggag 
aagcaatagc 
tttgtccaaa 
cttggcgtaa 
acacaacata 
actcacatta 
gctgcattaa 
cgcttcctcg 
tcactcaaag 
gtgagcaaaa 
ccataggctc 
aaacccgaca 
tcctgttccg 
ggcgctttct 
gctgggctgt 
tcgtcttgag 
caggattagc 
ctacggctac 
cggaaaaaga 
ttttgtttgc 
cttttctacg 
gagattatca 
aatctaaagt 
acctatctca 
gataactacg 
cccacgctca 
cagaagtggt 
tagagtaagt 
cgtggtgtca 
gcgagttaca 
cgttgtcaga 
ttctcttact 
gtcattctga 
taataccgcg 
gcgaaaactc 
acccaactga 
aaggcaaaat 



taatagtgga 

tgatttataa 

aaaatttaac 

ggctccccag 

tggaaagtcc 

agcaaccata 

ccattctccg 

tgcctctgag 

aaagctcccg 

gtttcgcatg 

gctattcggc 

gctgtcagcg 

tgaactgcag 

agctgtgctc 

ggggcaggat 

tgcaatgcgg 

acatcgcatc 

ggacgaagag 

gcccgacggc 

ggaaaatggc 

tcaggacata 

ccgcttcctc 

ccttcttgac 

cccaacctgc 

ggaatcgttt 

ttcttcgccc 

atcacaaatt 

ctcatcaatg 

tcatggtcat 

cgagccggaa 

attgcgttgc 

tgaatcggcc 

ctcactgact 

gcggtaatac 

ggccagcaaa 

cgcccccctg 

ggactataaa 

accctgccgc 

caatgctcac 

gtgcacgaac 

tccaacccgg 

agagcgaggt 

actagaagga 

gttggtagct 

aagcagcaga 

gggtctgacg 

aaaaggatct 

atatatgagt 

gcgatctgtc 

atacgggagg 

ccggctccag 

cctgcaactt 

agttcgccag 

cgctcgtcgt 

tgatccccca 

agtaagttgg 

gtcatgccat 

gaatagtgta 

ccacatagca 

tcaaggatct 

tcttcagcat 

gccgcaaaaa 



ctcttgttcc 

gggattttgg 

gcgaattaat 

gcaggcagaa 

ccaggctccc 

gtcccgcccc 

ccccatggct 

ctattccaga 

ggagcttgta 

attgaacaag 

tatgactggg 

caggggcgcc 

gacgaggcag 

gacgttgtca 

ctcctgtcat 

cggctgcata 

gagcgagcac 

catcaggggc 

gaggatctcg 

cgcttttctg 

gcgttggcta 

gtgctttacg 

gagttcttct 

catcacgaga 

tccgggacgc 

accccaactt 

tcacaaataa 

tatcttatca 

agctgtttcc 

gcataaagtg 

gctcactgcc 

aacgcgcggg 

cgctgcgctc 

ggttatccac 

aggccaggaa 

acgagcatca 

gataccaggc 

ttaccggata 

gctgtaggta 

cccccgttca 

taagacacga 

atgtaggcgg 

cagtatttgg 

cttgatccgg 

ttacgcgcag 

ctcagtggaa 

tcacctagat 

aaacttggtc 

tatttcgttc 

gcttaccatc 

atttatcagc 

tatccgcctc 

ttaatagttt 

ttggtatggc 

tgttgtgcaa 

ccgcagtgtt 

ccgtaagatg 

tgcggcgacc 

gaactttaaa 

taccgctgtt 

cttttacttt 

agggaataag 



4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7 920" 

7980 

8040 

8100 

8160 

8220 



-37- 



ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 8280 
tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 8340 
aggggttccg cgcacatttc cccgaaaagt gccacctgac gtc 8383 



<210> 44 
<211> 7960 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasraid 
<400> 44 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 

ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 

cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 

ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 

gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 

tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 

cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 

attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 

atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 54 0 

atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 

tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 

actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 

aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 

gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 84 0 

ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900 

gtttaaactt aagcttggta ccgagctcgg atccactctc ttccgcatcg ctgtctgcga 960 

gggccagctg ttggggtgag tactccctct gaaaagcggg catgacttct gcgctaagat 1020 

tgtcagtttc caaaaacgag gaggatttga tattcacctg gcccgcggtg atgcctttga 1080 

gggtggccgc atccatctgg tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa 1140 

acgacccgta gagggcgttg gacagcaact tggcgatgga gcgcagggtt tggtttttgt 1200 

cgcgatcggc gcgctccttg gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc 12 60 

gccattcggg aaagacggtg gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt 1320 

tgtgcagggt gacaaggtca acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc 1380 

agcagaggcg gccgcccttg cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt 14 4 0 

ccggggggtc tgcgtccacg gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta 1500 

tcttgcatcc ttgcaagtct agcgcctgct gccatgcgcg ggcggcaagc gcgcgctcgt 15 60 

atgggttgag tgggggaccc catggcatgg ggtgggtgag cgcggaggcg tacatgccgc 1620 

aaatgtcgta aacgtagagg ggctctctga gtattccaag atatgtaggg tagcatcttc 1680 

caccgcggat gctggcgcgc acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg 17 4 0 

gaccgaggtt gctacgggcg ggctgctctg ctcggaagac tatctgcctg aagatggcat 1800 

gtgagttgga tgatatggtt ggacgctgga agacgttgaa gctggcgtct gtgagaccta 18 60 

ccgcgtcacg cacgaaggag gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga 1920 

cctgcacgtc tagggcgcag tagtccaggg tttccttgat gatgtcatac ttatcctgtc 1980 

cctttttttt ccacagctcg cggttgagga caaactcttc gcggtctttc cagtactctt 2040 

ggatcggaaa cccgtcggcc tccgaacgag atccgtactc cgccgccgag ggacctgagc 2100 

gagtccgcat cgaccggatc ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg 2160 

caagatccaa gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 2220 

atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt gtatccccca 2280 

atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa cctctagtta 2340 

cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac gaggccggca 2400 

accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 24 60 

taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact gtggctgccg 2520 

ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc ccgctaaccg 2580 

tgcacgactc caaacttagc attgccaccc aaggacccct cacagtgtca gaaggaaagc 2 64 0 

tagccctgca aacatcaggc cccctcacca ccaccgatag cagtaccctt actatcactg 2700 

cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa gagcccattt 27 60 

atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta acagacgacc 2820 

taaacacttt gaccgtagca actggtccag gtgtgactat taataatact tccttgcaaa 2880 



-38- 



ctaaagttac 
gaggactaag 
atgctcaaaa 
acaacttgga 
aaaagcttga 
ccattaatgc 
tcaaaacaaa 
taggaactgg 
ataagctaac 
aagatgctaa 
cagttttggc 
ttattataag 
attggaactt 
ttatgcctaa 
tcagtcaagt 
acggtacaca 
actggtctgg 
acattgccca 
gatcagcctc 
cttccttgac 
catcgcattg 
agggggagga 
ctgaggcgga 
cattaagcgc 
tagcgcccgc 
gtcaagctct 
accccaaaaa 
tttttcgccc 
gaacaacact 
cggcctattg 
gaatgtgtgt 
aaagcatgca 
gcagaagtat 
cgcccatccc 
ttttttttat 
gaggaggctt 
ttttcggatc 
ataatacgac 
caccgcgcgc 
ggacttcgtg 
cgcggtccag 
ggacgagctg 
gccggccatg 
ggccggcaac 
cgattccacc 
ctggatgatc 
tattgcagct 
atttttttca 
ctgtataccg 
gtgaaattgt 
agcctggggt 
tttccagtcg 
aggcggtttg 
cgttcggctg 
atcaggggat 
taaaaaggcc 
aaatcgacgc 
tccccctgga 
gtccgccttt 
cagttcggtg 
cgaccgctgc 
atcgccactg 



tggagccttg 
gattgattct 
ccaactaaat 
tattaactac 
ggttaaccta 
aggagatggg 
aattggccat 
ccttagtttt 
tttgtggacc 
actcactttg 
tgttaaaggc 
atttgacgaa 
tagaaatgga 
cctatcagct 
ttacttaaac 
ggaaacagga 
ccacaactac 
agaataaaag 
gactgtgcct 
cctggaaggt 
tctgagtagg 
ttgggaagac 
aagaaccagc 
ggcgggtgtg 
tcctttcgct 
aaatcggggc 
acttgattag 
tttgacgttg 
caaccctatc 
gttaaaaaat 
cagttagggt 
tctcaattag 
gcaaagcatg 
gcccctaact 
ttatgcagag 
ttttggaggc 
tgatcagcac 
aaggtgagga 
gacgtcgccg 
gaggacgact 
gaccaggtgg 
tacgccgagt 
accgagatcg 
tgcgtgcact 
gccgccttct 
ctccagcgcg 
tataatggtt 
ctgcattcta 
tcgacctcta 
tatccgctca 
gcctaatgag 
ggaaacctgt 
cgtattgggc 
cggcgagcgg 
aacgcaggaa 
gcgttgctgg 
tcaagtcaga 
agctccctcg 
ctcccttcgg 
taggtcgttc 
gccttatccg 
gcagcagcca 



ggttttgatt 
caaaacagac 
ctaagactag 
aacaaaggcc 
agcactgcca 
cttgaatttg 
ggcctagaat 
gacagcacag 
acaccagctc 
gtcttaacaa 
agtttggctc 
aatggagtgc 
gatcttactg 
tatccaaaat 
ggagacaaaa 
gacacaactc 
attaatgaaa 
aagcggccgc 
tctagttgcc 
gccactccca 
tgtcattcta 
aatagcaggc 
tggggctcta 
gtggttacgc 
ttcttccctt 
atccctttag 
ggtgatggtt 
gagtccacgt 
tcggtctatt 
gagctgattt 
gtggaaagtc 
tcagcaacca 
catctcaatt 
ccgcccagtt 
gccgaggccg 
ctaggctttt 
gtgttgacaa 
actaaaccat 
gagcggtcga 
tcgccggtgt 
tgccggacaa 
ggtcggaggt 
gcgagcagcc 
tcgtggccga 
atgaaaggtt 
gggatctcat 
acaaataaag 
gttgtggttt 
gctagagctt 
caattccaca 
tgagctaact 
cgtgccagct 
gctcttccgc 
tatcagctca 
agaacatgtg 
cgtttttcca 
ggtggcgaaa 
tgcgctctcc 
gaagcgtggc 
gctccaagct 
gtaactatcg 
ctggtaacag 



cacaaggcaa 
gccttatact 
gacagggccc 
tttacttgtt 
aggggttgat 
gttcacctaa 
ttgattcaaa 
gtgccattac 
catctcctaa 
aatgtggcag 
caatatctgg 
tactaaacaa 
aaggcacagc 
ctcacggtaa 
ctaaacctgt 
caagtgcata 
tatttgccac 
tcgagtctag 
agccatctgt 
ctgtcctttc 
ttctgggggg 
atgctgggga 
gggggtatcc 
gcagcgtgac 
cctttctcgc 
ggttccgatt 
cacgtagtgg 
tctttaatag 
cttttgattt 
aacaaaaatt 
cccaggct cc 
ggtgtggaaa 
agtcagcaac 
ccgcccattc 
cctctgcctc 
gcaaaaagct 
ttaatcatcg 
ggccaagttg 
gttctggacc 
ggtccgggac 
caccctggcc 
cgtgtccacg 
gtgggggcgg 
ggagcaggac 
gggcttcgga 
gctggagttc 
caatagcatc 
gtccaaactc 
ggcgtaatca 
caacatacga 
cacattaatt 
gcattaatga 
ttcctcgctc 
ctcaaaggcg 
agcaaaaggc 
taggctccgc 
cccgacagga 
tgttccgacc 
gctttctcaa 
gggctgtgtg 
tcttgagtcc 
gattagcaga 



tatgcaactt 
tgatgttagt 
tctttttata 
tacagcttca 
gtttgacgct 
tgcaccaaac 
caaggctatg 
agtaggaaac 
ctgtagacta 
tcaaatactt 
aacagttcaa 
ttccttcctg 
ctatacaaac 
aactgccaaa 
aacactaacc 
ctctatgtca 
atcctcttac 
agggcccgtt 
tgtttgcccc 
ctaataaaat 
tggggtgggg 
tgcggtgggc 
ccacgcgccc 
cgctacactt 
cacgttcgcc 
tagtgcttta 
gccatcgccc 
tggactcttg 
ataagggatt 
taacgcgaat 
ccaggcaggc 
gtccccaggc 
catagtcccg 
tccgccccat 
tgagctattc 
cccgggagct 
gcatagtata 
accagtgccg 
gaccggctcg 
gacgtgaccc 
tgggtgtggg 
aacttccggg 
gagttcgccc 
tgacacgtgc 
atcgttttcc 
ttcgcccacc 
acaaatttca 
atcaatgtat 
tggtcatagc 
gccggaagca 
gcgttgcgct 
atcggccaac 
actgactcgc 
gtaatacggt 
cagcaaaagg 
ccccctgacg 
ctataaagat 
ctgccgctta 
tgctcacgct 
cacgaacccc 
aacccggtaa 
gcgaggtatg 



aatgtagcag 
tatccgtttg 
aactcagccc 
aacaattcca 
acagccatag 
acaaatcccc 
gttcctaaac 
aaaaataatg 
aatgcagaga 
gctacagttt 
agtgctcatc 
gacccagaat 
gctgttggat 
agtaacattg 
attacactaa 
ttttcatggg 
actttttcat 
taaacccgct 
tcccccgtgc 
gaggaaattg 
caggacagca 
tctatggctt 
tgtagcggcg 
gccagcgccc 
ggctttcccc 
cggcacctcg 
tgatagacgg 
ttccaaactg 
ttggggattt 
taattctgtg 
agaagtatgc 
tccccagcag 
cccctaactc 
ggctgactaa 
cagaagtagt 
tgtatatcca 
tcggcatagt 
ttccggtgct 
ggttctcccg 
tgttcatcag 
tgcgcggcct 
acgcctccgg 
tgcgcgaccc 
tacgagattt 
gggacgccgg 
ccaacttgtt 
caaataaagc 
cttatcatgt 
tgtttcctgt 
taaagtgtaa 
cactgcccgc 
gcgcggggag 
tgcgctcggt 
tatccacaga 
ccaggaaccg 
agcatcacaa 
accaggcgtt 
ccggatacct 
gtaggtatct 
ccgttcagcc 
gacacgactt 
taggcggtgc 



2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300' 

6360 

6420 

6480 

6540 

6600 



-39- 



tacagagttc 
ctgcgctctg 
acaaaccacc 
aaaaggatct 
aaactcacgt 
tttaaattaa 
cagttaccaa 
catagttgcc 
ccccagtgct 
aaaccagcca 
ccagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 



ttgaagtggt 
ctgaagccag 
gctggtagcg 
caagaagatc 
taagggattt 
aaatgaagtt 
tgcttaatca 
tgactccccg 
gcaatgatac 
gccggaaggg 
aattgttgcc 
gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtact 
ccggcgtcaa 
ggaaaacgtt 
atgtaaccca 
gggtgagcaa 
tgttgaatac 
ctcatgagcg 
acatttcccc 



ggcctaacta 
ttaccttcgg 
gtggtttttt 
ctttgatctt 
tggtcatgag 
ttaaatcaat 
gtgaggcacc 
tcgtgtagat 
cgcgagaccc 
ccgagcgcag 
gggaagctag 
caggcatcgt 
gatcaaggcg 
ctccgatcgt 
tgcataattc 
caaccaagtc 
tacgggataa 
cttcggggcg 
ctcgtgcacc 
aaacaggaag 
tcatactctt 
gatacatatt 
gaaaagtgcc 



cggctacact 
aaaaagagtt 
tgtttgcaag 
ttctacgggg 
attatcaaaa 
ctaaagtata 
tatctcagcg 
aactacgata 
acgctcaccg 
aagtggtcct 
agtaagtagt 
ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
attctgagaa 
taccgcgcca 
aaaactctca 
caactgatct 
gcaaaatgcc 
cctttttcaa 
tgaatgtatt 
acctgacgtc 



agaaggacag 
ggtagctctt 
cagcagatta 
tctgacgctc 
aggatcttca 
tatgagtaaa 
atctgtctat 
cgggagggct 
gctccagatt 
gcaactttat 
tcgccagtta 
tcgtcgtttg 
tcccccatgt 
aagttggccg 
atgccatccg 
tagtgtatgc 
catagcagaa 
aggatcttac 
tcagcatctt 
gcaaaaaagg 
tattattgaa 
tagaaaaata 



tatttggtat 
gatccggcaa 
cgcgcagaaa 
agtggaacga 
cctagatcct 
cttggtctga 
ttcgttcatc 
taccatctgg 
tatcagcaat 
ccgcctccat 
atagtttgcg 
gtatggcttc 
tgtgcaaaaa 
cagtgttatc 
taagatgctt 
ggcgaccgag 
ctttaaaagt 
cgctgttgag 
ttactttcac 
gaataagggc 
gcatttatca 
aacaaatagg 



6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7960 



<210> 45 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 45 

atgggatcca agatgaagcg cgcaagaccg 



30 



<210> 46 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 46 

cactatagcg gccgcattct cagtcatctt 



30 



<210> 47 
<211> 7989 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 47 " 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
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gattattgac tagttattaa tagtaatcaa 
tggagttccg cgttacataa cttacggtaa 
cccgcccatt gacgtcaata atgacgtatg 
attgacgtca atgggtggac tatttacggt 
atcatatgcc aagtacgccc cctattgacg 
atgcccagta catgacctta tgggactttc 
tcgctattac catggtgatg cggttttggc 
actcacgggg atttccaagt ctccacccca 
aaaatcaacg ggactttcca aaatgtcgta 
gtaggcgtgt acggtgggag gtctatataa 
ctgcttactg gcttatcgaa attaatacga 
gtttaaactt aagcttggta ccgagctcgg 
gggccagctg ttggggtgag tactccctct 
tgtcagtttc caaaaacgag gaggatttga 
gggtggccgc atccatctgg tcagaaaaga 
acgacccgta gagggcgttg gacagcaact 
cgcgatcggc gcgctccttg gccgcgatgt 
gccattcggg aaagacggtg gtgcgctcgt 
tgtgcagggt gacaaggtca acgctggtgg 
agcagaggcg gccgcccttg cgcgagcaga 
ccggggggtc tgcgtccacg gtaaagaccc 
tcttgcatcc ttgcaagtct agcgcctgct 
atgggttgag tgggggaccc catggcatgg 
aaatgtcgta aacgtagagg ggctctctga 
caccgcggat gctggcgcgc acgtaatcgt 
gaccgaggtt gctacgggcg ggctgctctg 
gtgagttgga tgatatggtt ggacgctgga 
ccgcgtcacg cacgaaggag gcgtaggagt 
cctgcacgtc tagggcgcag tagtccaggg 
cctttttttt ccacagctcg cggttgagga 
ggatcggaaa cccgtcggcc tccgaacgag 
gagtccgcat cgaccggatc ggaaaacctc 
caagatccaa gatgaagcgc gcaagaccgt 
atgacacgga aaccggtcct ccaactgtgc 
atgggtttca agagagtccc cctggggtac 
cctccaatgg catgcttgcg ctcaaaatgg 
accttacctc ccaaaatgta accactgtga 
taaacctgga aatatctgca cccctcacag 
ccgcacctct aatggtcgcg ggcaacacac 
tgcacgactc caaacttagc attgccaccc 
tagccctgca aacatcaggc cccctcacca 
cctcaccccc tctaactact gccactggta 
atacacaaaa tggaaaacta ggactaaagt 
taaacacttt gaccgtagca actggtccag 
ctaaagttac tggagccttg ggttttgatt 
gaggactaag gattgattct caaaacagac 
atgctcaaaa ccaactaaat ctaagactag 
acaacttgga tattaactac aacaaaggcc 
aaaagcttga ggttaaccta agcactgcca 
ccattaatgc aggagatggg cttgaatttg 
tcaaaacaaa aattggccat ggcctagaat 
taggaactgg ccttagtttt gacagcacag 
ataagctaac tttgtggacc ggtccaaaac 
aacaaaaccc agatagcaaa ctaactttaa 
gatatgtaac gctaatggga gcctcagact 
tctccattaa tgtagaacta tactttgatg 
ctcttaaaac agatctagaa ctaaaataca 
ttatgccaag tactacagcg tatccatttg 
attatatttt tggtcaatgc tactacaaag 
ttactgttat gcttaataaa cgcctgccag 
tatggtcctt gaatgctggt ctagctccag 
catttacctt ttcctatatt agagaagatg 



ttacggggtc attagttcat agcccatata 300 
atggcccgcc tggctgaccg cccaacgacc 360 
ttcccatagt aacgccaata gggactttcc 420 
aaactgccca cttggcagta catcaagtgt 4 80 
tcaatgacgg taaatggccc gcctggcatt 54 0 
ctacttggca gtacatctac gtattagtca 600 
agtacatcaa tgggcgtgga tagcggtttg 660 
ttgacgtcaa tgggagtttg ttttggcacc 720 
acaactccgc cccattgacg caaatgggcg 780 
gcagagctct ctggctaact agagaaccca 840 
ctcactatag ggagacccaa gctggctagc 900 
atccactctc ttccgcatcg ctgtctgcga 960 
gaaaagcggg catgacttct gcgctaagat 1020 
tattcacctg gcccgcggtg atgcctttga 1080 
caatcttttt gttgtcaagc ttggtggcaa 114 0 
tggcgatgga gcgcagggtt tggtttttgt 1200 
ttagctgcac gtattcgcgc gcaacgcacc 1260 
cgggcaccag gtgcacgcgc caaccgcggt 132 0 
ctacctctcc gcgtaggcgc tcgttggtcc 1380 
atggcggtag ggggtctagc tgcgtctcgt 14 4 0 
cgggcagcag gcgcgcgtcg aagtagtcta 1500 
gccatgcgcg ggcggcaagc gcgcgctcgt 1560 
ggtgggtgag cgcggaggcg tacatgccgc 1620 
gtattccaag atatgtaggg tagcatcttc 1680 
atagttcgtg cgagggagcg aggaggtcgg 17 4 0 
ctcggaagac tatctgcctg aagatggcat 1800 
agacgttgaa gctggcgtct gtgagaccta 18 60 
cgcgcagctt gttgaccagc tcggcggtga 1920 
tttccttgat gatgtcatac ttatcctgtc 1980 
caaactcttc gcggtctttc cagtactctt 204 0 
atccgtactc cgccgccgag ggacctgagc 210 0 
tcgagaaagg cgtctaacca gtcacagtcg 2160 
ctgaagatac cttcaacccc gtgtatccat 2220 
cttttcttac tcctcccttt gtatccccca 2280 
tctctttgcg cctatccgaa cctctagtta 2340 
gcaacggcct ctctctggac gaggccggca 2400 
gcccacctct caaaaaaacc aagtcaaaca 2460 
ttacctcaga agccctaact gtggctgccg 2520 
tcaccatgca atcacaggcc ccgctaaccg 2580 
aaggacccct cacagtgtca gaaggaaagc 2 64 0 
ccaccgatag cagtaccctt actatcactg 2700 
gcttgggcat tgacttgaaa gagcccattt 27 60 
acggggctcc tttgcatgta acagacgacc 2820 
gtgtgactat taataatact tccttgcaaa 2880 
cacaaggcaa tatgcaactt aatgtagcag 2940 
gccttatact tgatgttagt tatccgtttg 3000 
gacagggccc tctttttata aactcagccc 3060 
tttacttgtt tacagcttca aacaattcca 3120 
aggggttgat gtttgacgct acagccatag ' 3180 
gttcacctaa tgcaccaaac acaaatcccc 3240 
ttgattcaaa caaggctatg gttcctaaac 3300 
gtgccattac agtaggaaac aaaaataatg 3360 
cagaagccaa ctgcataatt gaatacggga 3420 
tccttgtaaa aaatggagga attgttaatg 3480 
acgttaacac cttatttaaa aacaaaaatg 354 0 
ccactggtca tatattacca gactcatctt 3600 
agcaaaccgc tgactttagt gcaagaggtt 3660 
tccttcctaa tgcgggaaca cataatgaaa 3720 
caagcgatgg tgcccttttt ccgttggaag 3780 
atagtcgcac atcctatgtt atgacttttt 3840 
aaactactca ggcaaccctc ataacctccc 3900 
actgattttt aagaagcggc cgctcgagtc 3960 
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tagagggccc gtttaaaccc gctgatcagc 
tgttgtttgc ccctcccccg tgccttcctt 
ttcctaataa aatgaggaaa ttgcatcgca 
gggtggggtg gggcaggaca gcaaggggga 
ggatgcggtg ggctctatgg cttctgaggc 
ggggtatccc cacgcgccct gtagcggcgc 
cagcgtgacc gctacacttg ccagcgccct 
ctttctcgcc acgttcgccg gctttccccg 
gttccgattt agtgctttac ggcacctcga 
acgtagtggg ccatcgccct gatagacggt 
ctttaatagt ggactcttgt tccaaactgg 
ttttgattta taagggattt tggggatttc 
acaaaaattt aacgcgaatt aattctgtgg 
ccaggctccc caggcaggca gaagtatgca 
gtgtggaaag tccccaggct ccccagcagg 
gtcagcaacc atagtcccgc ccctaactcc 
cgcccattct ccgccccatg gctgactaat 
ctctgcctct gagctattcc agaagtagtg 
caaaaagctc ccgggagctt gtatatccat 
taatcatcgg catagtatat cggcatagta 
gccaagttga ccagtgccgt tccggtgctc 
ttctggaccg accggctcgg gttctcccgg 
gtccgggacg acgtgaccct gttcatcagc 
accctggcct gggtgtgggt gcgcggcctg 
gtgtccacga acttccggga cgcctccggg 
tgggggcggg agttcgccct gcgcgacccg 
gagcaggact gacacgtgct acgagatttc 
ggcttcggaa tcgttttccg ggacgccggc 
ctggagttct tcgcccaccc caacttgttt 
aatagcatca caaatttcac aaataaagca 
tccaaactca tcaatgtatc ttatcatgtc 
gcgtaatcat ggtcatagct gtttcctgtg 
aacatacgag ccggaagcat aaagtgtaaa 
acattaattg cgttgcgctc actgcccgct 
cattaatgaa tcggccaacg cgcggggaga 
tcctcgctca ctgactcgct gcgctcggtc 
tcaaaggcgg taatacggtt atccacagaa 
gcaaaaggcc agcaaaaggc caggaaccgt 
aggctccgcc cccctgacga gcatcacaaa 
ccgacaggac tataaagata ccaggcgttt 
gttccgaccc tgccgcttac cggatacctg 
ctttctcaat gctcacgctg taggtatctc 
ggctgtgtgc acgaaccccc cgttcagccc 
cttgagtcca acccggtaag acacgactta 
attagcagag cgaggtatgt aggcggtgct 
ggctacacta gaaggacagt atttggtatc 
aaaagagttg gtagctcttg atccggcaaa 
gtttgcaagc agcagattac gcgcagaaaa 
tctacggggt ctgacgctca gtggaacgaa 
ttatcaaaaa ggatcttcac ctagatcctt 
taaagtatat atgagtaaac ttggtctgac 
atctcagcga tctgtctatt tcgttcatcc 
actacgatac gggagggctt accatctggc 
cgctcaccgg ctccagattt atcagcaata 
agtggtcctg caactttatc cgcctccatc 
gtaagtagtt cgccagttaa tagtttgcgc 
gtgtcacgct cgtcgtttgg tatggcttca 
gttacatgat cccccatgtt gtgcaaaaaa 
gtcagaagta agttggccgc agtgttatca 
cttactgtca tgccatccgt aagatgcttt 
ttctgagaat agtgtatgcg gcgaccgagt 
accgcgccac atagcagaac tttaaaagtg 



ctcgactgtg ccttctagtt gccagccatc 4020 
gaccctggaa ggtgccactc ccactgtcct 4080 
ttgtctgagt aggtgtcatt ctattctggg 4140 
ggattgggaa gacaatagca ggcatgctgg 4200 
ggaaagaacc snccntagct ggggctctag 4260 
attaagcgcg gcgggtgtgg tggttacgcg 4320 
agcgcccgct cctttcgctt tcttcccttc 4380 
tcaagctcta aatcggggca tccctttagg 4440 
ccccaaaaaa cttgattagg gtgatggttc 4500 
ttttcgccct ttgacgttgg agtccacgtt 4560 
aacaacactc aaccctatct cggtctattc 4 620 
ggcctattgg ttaaaaaatg agctgattta 4 680 
aatgtgtgtc agttagggtg tggaaagtcc 4740 
aagcatgcat ctcaattagt cagcaaccag 4800 
cagaagtatg caaagcatgc atctcaatta 4860 
gcccatcccg cccctaactc cgcccagttc 4 920 
tttttttatt tatgcagagg ccgaggccgc 4 980 
aggaggcttt tttggaggcc taggcttttg 5040 
tttcggatct gatcagcacg tgttgacaat 5100 
taatacgaca aggtgaggaa ctaaaccatg 5160 
accgcgcgcg acgtcgccgg agcggtcgag 5220 
gacttcgtgg aggacgactt qgccggtgtg 5280 
gcggtccagg accaggtggt gccggacaac 5340 
gacgagctgt acgccgagtg gtcggaggtc 5400 
ccggccatga ccgagatcgg cgagcagccg 54 60 
gccggcaact gcgtgcactt cgtggccgag 5520 
gattccaccg ccgccttcta tgaaaggttg 5580 
tggatgatcc tccagcgcgg ggatctcatg 564 0 
attgcagctt ataatggtta caaataaagc 5700 
tttttttcac tgcattctag ttgtggtttg 5760 
tgtataccgt cgacctctag ctagagcttg 5820 
tgaaattgtt atccgctcac aattccacac 5880 
gcctggggtg cctaatgagt gagctaactc 5940 
ttccagtcgg gaaacctgtc gtgccagctg 6000 
ggcggtttgc gtattgggcg ctcttccgct 6060 
gttcggctgc ggcgagcggt atcagctcac 6120 
tcaggggata acgcaggaaa gaacatgtga 6180 
aaaaaggccg cgttgctggc gtttttccat 6240 
aatcgacgct caagtcagag gtggcgaaac 6300 
ccccctggaa gctccctcgt gcgctctcct 6360 
tccgcctttc tcccttcggg aagcgtggcg 6420 
agttcggtgt aggtcgttcg ctccaagctg 64 80 
gaccgctgcg ccttatccgg taactatcgt 6540 
tcgccactgg cagcagccac tggtaacagg 6600 
acagagttct tgaagtggtg gcctaactac 6660 
tgcgctctgc tgaagccagt taccttcgga 6720 
caaaccaccg ctggtagcgg tggttttttt 6780 
aaaggatctc aagaagatcc tttgatcttt 6840 
aactcacgtt aagggatttt ggtcatgaga 6900 
ttaaattaaa aatgaagttt taaatcaatc 6960 
agttaccaat gcttaatcag tgaggcacct 7020 
atagttgcct gactccccgt cgtgtagata 7 08 0 
cccagtgctg caatgatacc gcgagaccca 7140 
aaccagccag ccggaagggc cgagcgcaga 7200 
cagtctatta attgttgccg ggaagctaga 7260 
aacgttgttg ccattgctac aggcatcgtg 7320 
ttcagctccg gttcccaacg atcaaggcga 7380 
gcggttagct ccttcggtcc tccgatcgtt 74 40 
ctcatggtta tggcagcact gcataattct 7500 
tctgtgactg gtgagtactc aaccaagtca 7560 
tgctcttgcc cggcgtcaat acgggataat 7 620 
ctcatcattg gaaaacgttc ttcggggcga 7 680 



'1 
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aaactctcaa 
aactgatctt 
caaaatgccg 
ctttttcaat 
gaatgtattt 
cctgacgtc 



ggatcttacc 
cagcatcttt 
caaaaaaggg 
attattgaag 
agaaaaataa 



gctgttgaga 
tactttcacc 
aataagggcg 
catttatcag 
acaaataggg 



tccagttcga 
agcgtttctg 
acacggaaat 
ggttattgtc 
gttccgcgca 



tgtaacccac 
ggtgagcaaa 
gttgaatact 
tcatgagcgg 
catttccccg 



tcgtgcaccc 7740 
aacaggaagg 7800 
catactcttc 78 60 
atacatattt 7920 
aaaagtgcca 7 980 
7989 



<210> 48 
<211> 7607 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 



<400> 48 

tctagaagat 

ttctagctac 

gtacaggatg 

agatcgatct 

gcgcgttcgt 

aacttcaggg 

atgttatatg 

tatcaccatg 

ttgtctcctc 

ttgtaacgaa 

tcactttttt 

acaattgtta 

ggaaatattc 

tggttacaat 

ccctctgcta 

gttattgtgc 

actcggtcga 

ggccgccagt 

tagccatttt 

agat cccaac 

ggaagggatt 

ttcccggcag 

tgtaccggag 

ggatgaagag 

gtcttgtcat 

tatgaggacc 

agtggtgggt 

ttttgtattg 

gaaccggagc 

ccgacatcac 

tctaacacac 

gtgagagttg 

gggcaacctt 

cgtgtgtggt 

ataatgttta 

cgtgggctaa 

tctgctgtgc 

ctgtggggct 

tttgaagagc 

gcgcttttcc 

gctgctgttg 

ggggggtacc 

aatcgcctgc 

cagcagcagg 

ggcctggacc 



ccgctgtaca 

tttattagat 

ttctagctac 

cctggccgtt 

cctcactctc 

tgagtttggg 

gagggggcaa 

gaccct catg 

ttattttctt 

tttttaaatt 

ttcaaggcaa 

taattaaatg 

ttattggtag 

gatatacact 

accatgttca 

tgtctcatca 

ctgaaaatga 

cttttggacc 

gaaccaccta 

gaggaggcgg 

gacttactca 

cccgagcagc 

gtgatcgatc 

ggtgaggagt 

tatcaccgga 

tgtggcatgt 

ttggtgtggt 

tgattttttt 

ctgcaagacc 

ctgtgtctag 

ctcctgagat 

gtgggcgtcg 

tggacttgag 

taacgccttt 

acttgcatgg 

tcttggttac 

gtaacttgct 

catcccaggc 

ttttgaaatc 

aagagaaggt 

cttttttgag 

tgctggattt 

tactgttgtc 

aggaagccag 

ctcgggaatg 



ggatgttcta 

ccgctgtaca 

tttattagat 

cggggtcaaa 

ttccgcatcg 

gacccttgat 

agttttcagg 

ataattttgt 

ttcattttct 

cacttttgtt 

tcagggtata 

ataaggtaga 

aaacaactac 

gtttgagatg 

tgccttcttc 

ttttggcaaa 

gacatattat 

agctgatcga 

cccttcacga 

tttcgcagat 

cttttccgcc 

cggagcagag 

ttacctgcca 

ttgtgttaga 

ggaatacggg 

ttgtctacag 

aatttttttt 

aaaaggtcct 

tacccgccgt 

agaatgcaat 

acacccggtg 

ccaggctgtg 

ctgtaaacgc 

gtttgctgaa 

cgtgttaaat 

atctgacctc 

ggaacagagc 

aaagttagtc 

ctgtggtgag 

catcaagact 

ttttataaag 

tctggccatg 

ttccgtccgc 

gcggcggcgg 

aatgttgtac 



gctactttat 
ggatgttcta 
ccgtgtacag 
aaccaggttt 
ctgtctgcga 
tgttctttct 
gtgttgttta 
ttctttcact 
gtaacttttt 
tatttgtcag 
ttatattgta 
atatttctgc 
atcctggtca 
aggataaaat 
tttttcctac 
gaattagatc 
ctgccacgga 
agaggtactg 
actgtatgat 
ttttcccgac 
ggcgcccggt 
agccttgggt 
cgaggctggc 
ttatgtggag 
ggacccagat 
taagtgaaaa 
ttaattttta 
gtgtctgaac 
cctaaaatgg 
agtagtacgg 
gtcccgctgt 
gaatgtatcg 
cccaggccat 
tgagttgatg 
ggggcggggc 
atggaggctt 
tctaacagta 
tgcagaatta 
ctgtttgatt 
ttggattttt 
gataaatgga 
catctgtgga 
ccggcgataa 
caggagcaga 
aggtggctga 



tagatccgct 
gctactttat 
gatgttctag 
ggctataaaa 
gggccaggat 
ttttcgctat 
gaatgggaag 
ttctactctg 
cgttaaactt 
attgtaagta 
cttcagcaca 
atataaattc 
tcatcctgcc 
actctgagtc 
agctcctggg 
taagcttctg 
ggtgttatta 
gctgataatc 
ttagacgtga 
tctgtaatgt 
tctccggagc 
ccggtttcta 
tttccaccca 
caccccgggc 
attatgtgtt 
ttatgggcag 
cagttttgtg 
ctgagcctga 
cgcctgctat 
atagctgtga 
gccccattaa 
aggacttgct 
aaggtgtaaa 
taagtttaat 
ttaaagggta 
gggagtgttt 
cctcttggtt 
aggaggatta 
ctttgaatct 
ccacaccggg 
gcgaagaaac 
gagcggttgt 
taccgacgga 
gcccatggaa 
actgtatcca 



gtacaggatg 
tagatccgct 
ctactttatt 
gggggtgggg 
cgatcctgag 
tgtaaaattc 
atgtcccttg 
ttgacaacca 
tagcttgcat 
ctttctctaa 
gttttagaga 
tggctggcgt 
tttctcttta 
caaaccgggc 
caacgtgctg 
cagctcgagg 
ccgaagaaat 
ttccacctcc 
cggcccccga 
tggcggtgca 
cgcctcacct 
tgccaaacct 
gtgacgacga 
acggttgcag 
cgctttgcta 
tgggtgatag 
gtttaaagaa 
gcccgagcca 
cctgagacgc 
ctccggtcct 
accagttgcc 
taacgagcct 
cctgtgattg 
aaagggtgag 
tataatgcgc 
ggaagatttt 
ttggaggttt 
caagtgggaa 
gggtcaccag 
gcgcgctgcg 
ccatctgagc 
gagacacaag 
ggagcagcag 
cccgagagcc 
gaactgagac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400' 

2460 

2520 

2580 

2640 

2700 
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gcattttgac aattacagag gatgggcagg 
cttgtgaggc tacagaggag gctaggaatc 
ctgagtgtat tacttttcaa cagatcaagg 
cgcagaagta ttccatagag cagctgacca 
aggaggctat tagggtatat gcaaaggtgg 
gcaaacttgt aaatatcagg aattgttgct 
tagatacgga ggatagggtg gcctttagat 
ttggcatgga cggggtggtt attatgaatg 
cggttttcct ggccaatacc aaccttatcc 
atacctgtgt ggaagcctgg accgatgtaa 
ggaagggggt ggtgtgtcgc cccaaaagca 
ggtgtacctt gggtatcctg tctgagggta 
actgtggttg cttcatgcta gtgaaaagcg 
gcaactgcga ggacagggcc tctcagatgc 
tgaagaccat tcacgtagcc agccactctc 
tactgacccg ctgttccttg catttgggta 
gcaatttgag tcacactaag atattgcttg 
acggggtgtt tgacatgacc atgaagatct 
ccaggtgcag accctgcgag tgtggcggta 
atgtgaccga ggagctgagg cccgatcact 
gctctagcga tgaagataca gattgaggta 
gaaagaatat ataaggtggg ggtcttatgt 
ccgccatgag caccaactcg tttgatggaa 
tgcccccatg ggccggggtg cgtcagaatg 
tcctgcccgc aaactctact accttgacct 
ctgcagcctc cgccgccgct tcagccgctg 
ttgctttcct gagcccgctt gcaagcagtg 
agttgacggc tcttttggca caattggatt 
agcagctgtt ggatctgcgc cagcaggttt 
cggtttaaaa cataaataaa aaaccagact 
gctgtctcag ctgactgctt aagtcgcaag 
ttaaagcaga acttgtttat tgcagcttat 
aatttcacaa ataaagcatt tttttcactg 
aatgtatctt atcatgtctg gtcgactcta 
ctgcgctcgg tcgttcggct gcggcgagcg 
ttatccacag aatcagggga taacgcagga 
gccaggaacc gtaaaaaggc cgcgttgctg 
gagcatcaca aaaatcgacg ctcaagtcag 
taccaggcgt ttccccctgg aagctccctc 
accggatacc tgtccgcctt tctcccttcg 
tgtaggtatc tcagttcggt gtaggtcgtt 
cccgttcagc ccgaccgctg cgccttatcc 
agacacgact tatcgccact ggcagcagcc 
gtaggcggtg ctacagagtt cttgaagtgg 
gtatttggta tctgcgctct gctgaagcca 
tgatccggca aacaaaccac cgctggtagc 
acgcgcagaa aaaaaggatc tcaagaagat 
cagtggaacg aaaactcacg ttaagggatt 
acctagatcc ttttaaatta aaaatgaagt 
acttggtctg acagttacca atgcttaatc 
tttcgttcat ccatagttgc ctgactcccc 
ttaccatctg gccccagtgc tgcaatgata 
ttatcagcaa taaaccagcc agccggaagg 
tccgcctcca tccagtctat taattgttgc 
aatagtttgc gcaacgttgt tgccattgct 
ggtatggctt cattcagctc cggttcccaa 
ttgtgcaaaa aagcggttag ctccttcggt 
gcagtgttat cactcatggt tatggcagca 
gtaagatgct tttctgtgac tggtgagtac 
cggcgaccga gttgctcttg cccggcgtca 
actttaaaag tgctcatcat tggaaaacgt 
ccgctgttga gatccagttc gatgtaaccc 



ggctaaaggg ggtaaagagg gagcgggggg 27 60 
tagcttttag cttaatgacc agacaccgtc 2820 
ataattgcgc taatgagctt gatctgctgg 2880 
cttactggct gcagccaggg gatgattttg 2 94 0 
cacttaggcc agattgcaag tacaagatca 3000 
acatttctgg gaacggggcc gaggtggaga 3060 
gtagcatgat aaatatgtgg ccgggggtgc 3120 
taaggtttac tggccccaat tttagcggta 3180 
tacacggtgt aagcttctat gggtttaaca 324 0 
gggttcgggg ctgtgccttt tactgctgct 3300 
gggcttcaat taagaaatgc ctctttgaaa 3360 
actccagggt gcgccacaat gtggcctccg 3420 
tggctgtgat taagcataac atggtatgtg 3480 
tgacctgctc ggacggcaac tgtcacctgc 3540 
gcaaggcctg gccagtgttt gagcataaca 3600 
acaggagggg ggtgttccta ccttaccaat 3660 
agcccgagag catgtccaag gtgaacctga 3720 
ggaaggtgct gaggtacgat gagacccgca 378 0 
aacatattag gaaccagcct gtgatgctgg 3840 
tggtgctggc ctgcacccgc gctgagtttg 3900 
ctgaaatgtg tgggcgtggc ttaagggtgg 3960 
agttttgtat ctgttttgca cjcagccgccg 4020 
gcattgtgag ctcatatttg acaacgcgca 4080 
tgatgggctc cagcattgat ggt cgccccg 414 0 
acgagaccgt gtctggaacg ccgttggaga 4200 
cagccaccgc ccgcgggatt gtgactgact 4 2 60 
cagcttcccg ttcatccgcc cgcgatgaca 4320 
ctttgacccg ggaacttaat gtcgtttctc 4380 
ctgccctgaa ggcttcctcc cctcccaatg 4 440 
ctgtttggat ttggatcaag caagtgtctt 4500 
ccgaattgga tccaattcgg atcgatctta 4560 
aatggttaca aataaagcaa tagcatcaca 4 620 
cattctagtt gtggtttgtc caaactcatc 4 680 
gactcttccg cttcctcgct cactgactcg 4740 
gtatcagctc actcaaaggc ggtaatacgg 4800 
aagaacatgt gagcaaaagg ccagcaaaag 4860 
gcgtttttcc ataggctccg cccccctgac 4 920 
aggtggcgaa acccgacagg actataaaga 4980 
gtgcgctctc ctgttccgac cctgccgctt 5040 
ggaagcgtgg cgctttctca tagctcacgc 5100 
cgctccaagc tgggctgtgt gcacgaaccc 5160 
ggtaactatc gtcttgagtc caacccggta 5220 
actggtaaca ggattagcag agcgaggtat 5280 
tggcctaact acggctacac tagaaggaca 5340 
gttaccttcg gaaaaagagt tggtagctct 5400 
ggtggttttt ttgtttgcaa gcagcagatt 54 60 
cctttgatct tttctacggg gtctgacgct 5520 
ttggtcatga gattatcaaa aaggatcttc 5580 
tttaaatcaa tctaaagtat atatgagtaa 5640 
agtgaggcac ctatctcagc gatctgtcta 5700 
gtcgtgtaga taactacgat acgggagggc 57 60 
ccgcgagacc cacgctcacc ggctccagat 5820 
gccgagcgca gaagtggtcc tgcaacttta 5880 
cgggaagcta gagtaagtag ttcgccagtt 5940 
acaggcatcg tggtgtcacg ctcgtcgttt 6000 
cgatcaaggc gagttacatg atcccccatg 60 60 
cctccgatcg ttgtcagaag taagttggcc 6120 
ctgcataatt ctcttactgt catgccatcc 6180 
tcaaccaagt cattctgaga atagtgtatg 6240 
atacgggata ataccgcgcc acatagcaga 6300 
tcttcggggc gaaaactctc aaggatctta 6360 
actcgtgcac ccaactgatc ttcagcatct 6420 
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tttactttca 
ggaataaggg 
agcatttatc 
aaacaaatag 
attattatca 
gcgtttcggt 
ttgtctgtaa 
cgggtgtcgg 
tatgcggtgt 
agcgttaata 
caataggccg 
agtgttgttc 
gggcgaaaaa 
tttttggggt 
agagcttgac 
gcgggcgcta 
gcgcttaatg 
gaagggcgat 
gcaaggcgat 
gccagtgaat 



ccagcgtttc 
cgacacggaa 
agggttattg 
gggttccgcg 
tgacattaac 
gatgacggtg 
gcggatgccg 
ggctggctta 
gaaataccgc 
ttttgttaaa 
aaatcggcaa 
cagtttggaa 
ccgtctatca 
cgaggtgccg 
ggggaaagcc 
gggcgctggc 
cgccgctaca 
cggtgcgggc 
taagttgggt 
tgtaatacga 



tgggtgagca 
atgttgaata 
tctcatgagc 
cacatttccc 
ctataaaaat 
aaaacctctg 
ggagcagaca 
actatgcggc 
acagatgcgt 
attcgcgtta 
aatcccttat 
caagagtcca 
gggcgatggc 
taaagcacta 
ggcgaacgtg 
aagtgtagcg 
gggcgcgtcc 
ctcttcgcta 
aacgccaggg 
ctcactatag 



aaaacaggaa 
ctcatactct 
ggatacatat 
cgaaaagtgc 
aggcgtatca 
acacatgcag 
agcccgtcag 
atcagagcag 
aaggagaaaa 
aatttttgtt 
aaatcaaaag 
ctattaaaga 
ccactacgtg 
aatcggaacc 
gcgagaaagg 
gtcacgctgc 
cattcgccat 
ttacgccagc 
ttttcccagt 
ggcgaattaa 



ggcaaaatgc 
tcctttttca 
ttgaatgtat 
cacctgacgt 
cgaggcccct 
ctcccggaga 
ggcgcgtcag 
attgtactga 
taccgcatca 
aaatcagctc 
aatagaccga 
acgtggactc 
aaccatcacc 
ctaaagggag 
aagggaagaa 
gcgtaaccac 
tcaggctgcg 
tggcgaaagg 
cacgacgttg 
ttcgggg 



cgcaaaaaag 
atattattga 
ttagaaaaat 
ctaagaaacc 
ttcgtctcgc 
cggtcacagc 
cgggtgttgg 
gagtgcacca 
ggaaattgta 
attttttaac 
gatagggttg 
caacgtcaaa 
ctaatcaagt 
cccccgattt 
agcgaaagga 
cacacccgcc 
caactgttgg 
gggatgtgct 
taaaacgacg 



6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7607 



<210> 49 
<211> 11600 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 49 

gaattccgca ttgcagagat attgtattta agtgcctagc tcgatacaat aaacgccatt 60 

tgaccattca ccacattggt gtgcacctcc aagcttgggc agaaatggtt gaactcccga 120 

gagtgtccta cacctagggg agaagcagcc aaggggttgt ttcccaccaa ggacgacccg 180 

tctgcgcaca aacggatgag cccatcagac aaagacatat tcattctctg ctgcaaactt 240 

ggcatagctc tgctttgcct ggggctattg ggggaagttg cggttcgtgc tcgcagggct 300 

ctcacccttg actcttttaa tagctcttct gtgcaagatt acaatctaaa caattcggag 360 

aactcgacct tcctcctgag gcaaggacca cagccaactt cctcttacaa gccgcatcga 420 

ttttgtcctt cagaaataga aataagaatg cttgctaaaa attatatttt taccaataag 4 80 

accaatccaa taggtagatt attagttact atgttaagaa atgaatcatt atcttttagt 540 

actattttta ctcaaattca gaagttagaa atgggaatag aaaatagaaa gagacgctca 600 

acctcaattg aagaacaggt gcaaggacta ttgaccacag gcctagaagt aaaaaaggga 660 

aaaaagagtg tttttgtcaa aataggagac aggtggtggc aaccagggac ttatagggga 720 

ccttacatct acagaccaac agatgccccc ttaccatata caggaagata tgacttaaat 780 

tgggataggt gggttacagt caatggctat aaagtgttat atagatccct cccttttcgt 840 

gaaagactcg ccagagctag acctccttgg tgtatgttgt ctcaagaaga aaaagacgac 900 

atgaaacaac aggtacatga ttatatttat ctaggaacag gaatgcactt ttggggaaag 960 

attttccata ccaaggaggg gacagtggct ggactaatag aacattattc tgcaaaaact 1020 

catggcatga gttattatga atagccttta ttggcccaac cttgcggttc ccagggctta 1080 

agtaagtttt tggttacaaa ctgttcttaa aacgaggatg tgagacaagt ggtttcctga 1140 

cttggtttgg tatcaaaggt tctgatctga gctctgagtg ttctattttc ctatgttctt 1200 

ttggaattta tccaaatctt atgtaaatgc ttatgtaaac caagatataa aagagtgctg 1260 

attttttgag taaacttgca acagtcctaa cattcacctc ttgtgtgttt gtgtctgttc 1320 

gccatcccgt ctccgctcgt cacttatcct tcactttcca gagggtcccc ccgcagaccc 1380 

cggcgaccct caggtcggcc gactgcggca gctggcgccc gaacagggac cctcggataa 14 40 

gtgacccttg tctctatttc tactatttgg tgtttgtctt gtattgtctc tttcttgtct 1500 

ggctatcatc acaagagcgg aacggactca ccatagggac caagctagcg cttctcgtcg 1560 

cgtccaagac cctcaaagat ttttggcact tcgttgagcg aggcgatatc aggtatgaca 1620 

gcgccctgcc gcaaggccag ctgcttgtcc gctcggctgc ggttggcacg gcaggatagg 1680 

ggtatcttgc agttttggaa aaagatgtga taggtggcaa gcacctctgg cacggcaaat 1740 

acggggtaga agttgaggcg cgggttgggc tcgcatgtgc cgttttcttg gcgtttgggg 1800 

ggtacgcgcg gtgagaatag gtggcgttcg taggcaaggc tgacatccgc tatggcgagg 18 60 



i* 



ggcacatcgc tgcgctcttg caacgcgtcg 
ttcaacagca cgtcgtctcc cacatctagg 
acttgttcct cgtttgcctc tgcgttgtcc 
gagcggtcct cgtcgtcttc gcttacaaaa 
tcctcaagcg ggggtgcctc gacggggaag 
gcggtggtgg cgaactcaga gggggcggtt 
atctttttct gcctatagga gaaggaaatg 
acccccgagc gcggacgcgg tgcggcgcga 
ccgtccccgt cgccgccgcc tccccgggcg 
gagtccgagg acgaggaaga ctcatcacaa 
cggccatcga cctcggcggc ggatttggcc 
tctcccaagc ccgagcgccc gccatcacca 
gaagatgtgg cgctacaaat ggtgggtttc 
aaaggaggta agcgcacagt gcggcggctg 
cggacgcaag aggaagagga agagcccagc 
ccgctgagtg tgccgatcgt gtctgcgtgg 
atggacaagt accacgtgga taacgatcta 
gtggaagctc tggcggccgt atgcaagacc 
ctgaccttca ccagcaacaa gacctttgtg 
ctgcagtcgt ttgcagaggt gacctacaag 
ctgcaccgct gcgctgagat cgaaggcgag 
aataaggagc acgtgattga aatggatgtg 
gagcagtcta gcaaggccaa gatcgtgaag 
tccaacaccg acgcaaggtg ctgcgtgcac 
ggcaagtctt gcggcatgtt cttctctgaa 
atcaaggctt ttatgcaggc gctgtatcct 
atgccactac ggtgcgagtg caactcaaag 
ctaccaaagt tgactccgtt cgccctgagc 
tccgacaaga gcgtgctggc cagcgtgcac 
aaccctgtgt atcgcaactc gcgcgcgcag 
tcggcgcccg acctgctaaa cgcgttggtg 
accgagctgc cgcggatggt tgtgcctgag 
aacgtgtccc tgccagtggc gcatagcgat 
gcgcagacgg caagggtggg ggtaaataat 
cctttattga aagtgtctct agtacattat 
gtggcgctcc taatctgcgc actgtggctg 
ctgtagagct gttcctggtt gcgacgcagg 
gagttgggta ccccggtaat aaggttcatg 
cagttggcaa aggcgtggag aaacatgcag 
ccctgtacgc tttgggtgga cttttccagc 
gcgctacggc gcaggagtga ctcgtactca 
gaaaagccaa agggctcaaa gaggtagcat 
atccagtgta cgcccccagt ctcgcgaccg 
tgtggagaaa caaagcctgg aaagcgcttg 
ccaagatctt tgacaatggc tttcagttcc 
gttgatgttg cttgcttctt tatgttgtgg 
acacggtttc gatgacgccg cggtgcggcc 
acaaaacata aagaagggtg ggctcgtcca 
ttacctcagg tcgacctcga gggatctttg 
aattggacaa actacctaca gagatttaaa 
tataatgtgt taaactactg attctaattg 
ctgatgaatg ggagcagtgg tggaatgcct 
aaatgccatc tagtgatgat gaggctactg 
agaagagaaa ggtagaagac cccaaggact 
atgctgtgtt tagtaataga actcttgctt 
ctgcactgct atacaagaaa attatggaaa 
acagttataa tcataacata ctgttttttc 
ttaataacta tgctcaaaaa ttgtgtacct 
aggaatattt gatgtatagt gccttgacta 
gaggttttac ttgctttaaa aaacctccca 
aatgcaattg ttgttgttaa cttgtttatt 
agcatcacaa atttcacaaa taaagcattt 
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cagataatgg cgcactggcg ctgcagatgc 1920 
tagtcgccat gcctttcgtc cccccgcccg 1980 
tggtcttgct ttttatcctc tgttggtact 2040 
cctgggtcct gctcgataat cacttcctcc 2100 
gtggtaggcg cgttggcggc atcggtggag 2160 
aggctgtcct tcttctcgac tgactccatg 2220 
gccagtcggg aagaggagca gcgcgaaacc 2280 
cgtcccccaa ccatggagga cgtgtcgtcc 2340 
cccccaaaaa agcggatgag gcggcgtatc 2400 
gacgcgctgg tgccgcgcac acccagcccg 24 60 
attgcgccca agaagaaaaa gaagcgccct 2520 
gaggtaatcg tggacagcga ggaagaaaga 2580 
agcaacccac cggtgctaat caagcatggc 264 0 
aatgaagacg acccagtggc gcgtggtatg 2700 
gaagcggaaa gtgaaattac ggtgatgaac 27 60 
gagaagggca tggaggctgc gcgcgcgctg 2820 
aaggcgaact tcaaactact gcctgaccaa 2880 
tggctgaacg aggagcaccg cgggttgcag 294 0 
acgatgatgg ggcgattcct gcaggcgtac 3000 
catcacgagc ccacgggctg cgcgttgtgg 3060 
cttaagtgtc tacacggaag cattatgata 3120 
acgagcgaaa acgggcagcg cgcgctgaag 3180 
aaccggtggg gccgaaatgt ggtgcagatc 324 0 
gacgcggcct gtccggccaa tcagttttcc 3300 
ggcgcaaagg ctcaggtggc ttttaagcag 3360 
aacgcccaga ccgggcacgg tcaccttttg 3420 
cctgggcacg cgcccttttt gggaaggcag 3480 
aacgcggagg acctggacgc ggatctgatc 354 0 
cacccggcgc tgatagtgtt ccagtgctgc 3600 
ggcggaggcc ccaactgcga cttcaagata 3660 
atggtgcgca gcctgtggag tgaaaacttc 3720 
tttaagtgga gcactaaaca ccagtatcgc 3780 
gcgcggcaga acccctttga tttttaaacg 3840 
cacccgagag tgtacaaata aaagcatttg 3900 
ttttacatgt ttttcaagtg acaaaaagaa 3960 
cggaagtagg gcgagtggcg ctccaggaag 4020 
gtgggctgta cctggggact gttgagcatg 4080 
gtggggttgt gatccatggg agtttggggc 414 0 
cagaatagtc cacaggcggc cgagttgggc 4 200 
gttatacagc ggtcggggga agaagcaatg 4260 
aactggtaaa cctgcttgag tcgctggtca 4320 
gtttttgagt gcgggttcca ggcaaaggcc 4380 
gccgtattga ctatggcgca ggcgagcttg 4 44 0 
tcataggtgc ccaaaaaata tggcccacaa 4500 
tgctcactgg agcccatggc ggcagctgtt 4 560 
cgttgccggc cgagaagggc gtgcgcaggt 4 620 
ggtgcacacg gaccacgtca aagacttcaa 4 680 
tgggatccat atatagggcc cgggttataa 4740 
tgaaggaacc ttacttctgt ggtgtgacat 4 800 
gctctaaggt aaatataaaa tttttaagtg 4860 
tttgtgtatt ttagattcca acctatggaa 4920 
ttaatgagga aaacctgttt tgctcagaag 4 980 
ctgactctca acattctact cctccaaaaa 5040 
ttccttcaga attgctaagt tttttgagtc 5100 
gctttgctat ttacaccaca aaggaaaaag 5160 
aatattctgt aacctttata agtaggcata 5220 _ 
ttactccaca caggcataga gtgtctgcta 5280 
ttagcttttt aatttgtaaa ggggttaata 5340 
gagatcataa tcagccatac cacatttgta 5400 
cacctccccc tgaacctgaa acataaaatg 54 60 
gcagcttata atggttacaa ataaagcaat 5520 
ttttcactgc attctagttg tggtttgtcc 5580 
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aaactcatca atgtatctta tcatgtctgg atccggctgt ggaatgtgtg tcagttaggg 5 64 0 

tgtggaaagt ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag 57 00 

tcagcaacca ggtgtggaaa gtccccaggc tccccagcag gcagaagtat gcaaagcatg 57 60 

catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc gcccctaact 5820 

ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag 5880 

gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc 594 0 

ctaggctttt gcaaaaagct tcacgctgcc gcaagcactc agggcgcaag ggctgctaaa 6000 

ggaagcggaa cacgtagaaa gccagtccgc agaaacggtg ctgaccccgg atgaatgtca 6060 

gctactgggc tatctggaca agggaaaacg caagcgcaaa gagaaagcag gtagcttgca 6120 

gtgggcttac atggcgatag ctagactggg cggttttatg gacagcaagc gaaccggaat 6180 

tgccagctgg ggcgccctct ggtaaggttg ggaagccctg caaagtaaac tggatggctt 624 0 

tcttgccgcc aaggatctga tggcgcaggg gatcaagatc tgatcaagag acaggatgag 6300 

gatcgtttcg catgattgaa caagatggat tgcacgcagg ttctccggcc gcttgggtgg 6360 

agaggctatt cggctatgac tgggcacaac agacaatcgg ctgctctgat gccgccgtgt 64 20 

tccggctgtc agcgcagggg cgcccggttc tttttgtcaa gaccgacctg tccggtgccc 6480 

tgaatgaact gcaggacgag gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt 654 0 

gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga ctggctgcta ttgggcgaag 6600 

tgccggggca ggatctcctg tcatctcacc ttgctcctgc cgagaaagta tccatcatgg 6660 

ctgatgcaat gcggcggctg catacgcttg atccggctac ctgcccattc gaccaccaag 6720 

cgaaacatcg catcgagcga gcacgtactc ggatggaagc cggtcttgtc gatcaggatg 678 0 

atctggacga agagcatcag gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc 68 4 0 

gcatgcccga cggcgaggat ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca 6900 

tggtggaaaa tggccgcttt tctggattca tcgactgtgg ccggctgggt ^tggcggacc 6960 

gctatcagga catagcgttg gctacccgtg atattgctga agagcttggc ggcgaatggg 7020 

ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga ttcgcagcgc atcgccttct 7 08 0 

atcgccttct tgacgagttc ttctgagcgg gactctgggg ttcgaaatga ccgaccaagc 714 0 

gacgcccaac ctgccatcac gagatttcga ttccaccgcc gccttctatg aaaggttggg 7200 

cttcggaatc gttttccggg acgccggctg gatgatcctc cagcgcgggg atctcatgct 72 60 

ggagttcttc gcccaccccg ggctcgatcc cctcgcgagt tggttcagct gctgcctgag 7320 

gctggacgac ctcgcggagt tctaccggca gtgcaaatcc gtcggcatcc aggaaaccag 7380 

cagcggctat ccgcgcatcc atgcccccga actgcaggag tggggaggca cgatggccgc 74 40 

tttggtcccg gatctttgtg aaggaacctt acttctgtgg tgtgacataa ttggacaaac 7500 

tacctacaga gatttaaagc tctaaggtaa atataaaatt tttaagtgta taatgtgtta 7 560 

aactactgat tctaattgtt tgtgtatttt agattccaac ctatggaact gatgaatggg 7 620 

agcagtggtg gaatgccttt aatgaggaaa acctgttttg ctcagaagaa atgccatcta 7 68 0 

gtgatgatga ggctactgct gactctcaac attctactcc tccaaaaaag aagagaaagg 774 0 

tagaagaccc caaggacttt ccttcagaat tgctaagttt tttgagtcat gctgtgttta 7800 

gtaatagaac tcttgcttgc tttgctattt acaccacaaa ggaaaaagct gcactgctat 7860 

acaagaaaat tatggaaaaa tattctgtaa cctttataag taggcataac agttataatc 7920 

ataacatact gttttttctt actccacaca ggcatagagt gtctgctatt aataactatg 7980 

ctcaaaaatt gtgtaccttt agctttttaa tttgtaaagg ggttaataag gaatatttga 8040 

tgtatagtgc cttgactaga gatcataatc agccatacca catttgtaga ggttttactt 8100 

gctttaaaaa acctcccaca cctccccctg aacctgaaac ataaaatgaa tgcaattgtt 8160 

gttgttaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 8220 

ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 8280 

gtatcttatc atgtctggat ccccaggaag ctcctctgtg tcctcataaa ccctaacctc 8340 

ctctacttga gaggacattc caatcatagg ctgcccatcc accctctgtg tcctcctgtt 8400 

aattaggtca cttaacaaaa aggaaattgg gtaggggttt ttcacagacc gctttctaag 8460 

ggtaatttta aaatatctgg gaagtccctt ccactgctgt gttccagaag tgttggtaaa 8520 

cagcccacaa atgtcaacag cagaaacata caagctgtca gctttgcaca agggcccaac 8580 

accctgctca tcaagaagca ctgtggttgc tgtgttagta atgtgcaaaa caggaggcac 8640 

attttcccca cctgtgtagg ttccaaaata tctagtgttt tcatttttac ttggatcagg 8700 

aacccagcac tccactggat aagcattatc cttatccaaa acagccttgt ggtcagtgtt 87 60 

catctgctga ctgtcaactg tagcattttt tggggttaca gtttgagcag gatatttggt 8820 

cctgtagttt gctaacacac cctgcagctc caaaggttcc ccaccaacag caaaaaaatg 8880 

aaaatttgac ccttgaatgg gttttccagc accattttca tgagtttttt gtgtccctga 8940 

atgcaagttt aacatagcag ttaccccaat aacctcagtt ttaacagtaa cagcttccca 9000 

catcaaaata tttccacagg ttaagtcctc atttaaatta ggcaaaggaa ttcttgaaga 9060 

cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct 9120 

tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc 9180 

taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa 9240 

tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt 9300 



-47- 



gcggcatttt 
gaagatcagt 
cttgagagtt 
tgtggcgcgg 
tattctcaga 
atgacagtaa 
ttacttctga 
gatcatgtaa 
gagcgtgaca 
gaactactta 
gcaggaccac 
gccggtgagc 
cgtatcgtag 
atcgctgaga 
tatatacttt 
ctttttgata 
gaccccgtag 
tgcttgcaaa 
ccaactcttt 
ctagtgtagc 
gctctgctaa 
ttggactcaa 
tgcacacagc 
ctatgagaaa 
agggtcggaa 
agtcctgtcg 
gggcggagcc 
tggccttttg 
accgcctttg 
gtgagcgagg 
atttcacacc 
cagtatctgc 
gctacaacaa 
tttgcgctgc 
tttaggcgaa 
tttcgctttt 
acatggtaac 
gattggtgga 
catggattgg 



gccttcctgt 
tgggtgcacg 
ttcgccccga 
tattatcccg 
atgacttggt 
gagaattatg 
caacgatcgg 
ctcgccttga 
ccacgatgcc 
ctctagcttc 
ttctgcgctc 
gtgggtctcg 
ttatctacac 
taggtgcctc 
agattgattt 
atctcatgac 
aaaagatcaa 
caaaaaaacc 
ttccgaaggt 
cgtagttagg 
tcctgttacc 
gacgatagtt 
ccagcttgga 
gcgccacgct 
caggagagcg 
ggtttcgcca 
tatggaaaaa 
ctcacatgtt 
agtgagctga 
aagcggaaga 
gcatatggtg 
tccctgcttg 
ggcaaggctt 
ttcgcgatgt 
aagcggggct 
gcatagggag 
gatgagttag 
agtaaggtgg 
acgaaccact 



ttttgctcac 
agtgggttac 
agaacgtttt 
tgttgacgcc 
tgagtactca 
cagtgctgcc 
aggaccgaag 
tcgttgggaa 
tgcagcaatg 
ccggcaacaa 
ggcccttccg 
cggtatcatt 
gacggggagt 
actgattaag 
aaaacttcat 
caaaatccct 
aggatcttct 
accgctacca 
aactggcttc 
ccaccacttc 
agtggctgct 
accggataag 
gcgaacgacc 
tcccgaaggg 
cacgagggag 
cctctgactt 
cgccagcaac 
ctttcctgcg 
taccgctcgc 
gcgcctgatg 
cactctcagt 
tgtgttggag 
gaccgacaat 
acgggccaga 
tcggttgtac 
ggggaaatgt 
caacatgcct 
tacgatcgtg 



ccagaaacgc 

atcgaactgg 

ccaatgatga 

gggcaagagc 

ccagtcacag 

ataaccatga 

gagctaaccg 

ccggagctga 

gcaacaacgt 

ttaatagact 

gctggctggt 

gcagcactgg 

caggcaacta 

cattggtaac 

ttttaattta 

taacgtgagt 

tgagatcctt 

gcggtggttt 

agcagagcgc 

aagaactctg 

gccagtggcg 

gcgcagcggt 

tacaccgaac 

agaaaggcgg 

cttccagggg 

gagcgtcgat 

gcggcctttt 

ttatcccctg 

cgcagccgaa 

cggtattttc 

acaatctgct 

gtcgctgagt 

tgcatgaaga 

tatacgcgta 

gcggttagga 

agtcttatgc 

tacaaggaga 

ccttattagg 



tggtgaaagt 

atctcaacag 

gcacttttaa 

aactcggtcg 

aaaagcatct 

gtgataacac 

cttttttgca 

atgaagccat 

tgcgcaaact 

ggatggaggc 

ttattgctga 

ggccagatgg 

tggatgaacg 

tgtcagacca 

aaaggatcta 

tttcgttcca 

tttttctgcg 

gtttgccgga 

agataccaaa 

tagcaccgcc 

ataagtcgtg 

cgggctgaac 

tgagatacct 

acaggtatcc 

gaaacgcctg 

ttttgtgatg 

tacggttcct 

attctgtgga 

cgaccgagcg 

tccttacgca 

ctgatgccgc 

agtgcgcgag 

atctgcttag 

tctgagggga 

gtcccctcag 

aatacacttg 

gaaaaagcac 

aaggcaacag 



aaaagatgct 
cggtaagatc 
agttctgcta 
ccgcatacac 
tacggatggc 
tgcggccaac 
caacatgggg 
accaaacgac 
attaactggc 
ggataaagtt 
taaatctgga 
taagccctcc 
aaatagacag 
agtttactca 
ggtgaagatc 
ctgagcgt ca 
cgtaatctgc 
tcaagagcta 
tactgtcctt 
tacatacctc 
tcttaccggg 

pgggggttcg 

acagcgtgag 
ggtaagcggc 
gtatctttat 
ctcgtcaggg 
ggccttttgc 
taaccgtatt 
cagcgagt ca 
tctgtgcggt 
atagttaagc 
caaaatttaa 
ggttaggcgt 
ctagggtgtg 
gatatagtag 
tagtcttgca 
cgtgcatgcc 
acgggtctga 



9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11600 



<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 49 



<210> 50 
<211> 8238 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 50 

gcggccgcca tcatcaataa tataccttat tttggattga agccaatatg ataatgaggg 60 
ggtggagttt gtgacgtggc gcggggcgtg ggaacggggc gggtgacgta gtagtgtggc 120 
ggaagtgtga tgttgcaagt gtggcggaac acatgtaagc gacggatgtg gcaaaagtga 180 
cgtttttggt gtgcgccggt gtacacagga agtgacaatt ttcgcgcggt tttaggcgga 240 



s 
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tgttgtagta 
taagaggaag 
gccgcgggga 
ccagatatac 
tgtacgcggt 
aatgtagtct 
tgccttacaa 
tcgtgcctta 
ccgcattgca 
attcaccaca 
gcgggggcca 
cttggatcgg 
tccgcatcga 
tagaactagt 
gcacctcttt 
cgcgtctgcg 
gcgccctcgc 
actgcgggtt 
ggtggccctg 
ggtgctgggg 
gggtgagata 
catgccttat 
ttcacatgcc 
cctgtgctac 
gttcgtggcc 
ggaggacaga 
ggctatgctg 
gcagggcggc 
ccagggtgcc 
taccctgttt 
ctgggccttg 
cgaccaatcg 
gacccacgtc 
ccgggagatg 
ataccgtcga 
aaatttcaca 
caatgtatct 
gacccgcacc 
gatgctggat 
tgagtttggc 
aagggtggga 
agccgccgcc 
aacgcgcatg 
tcgccccgtc 
gttggagact 
gactgacttt 
cgatgacaag 
cgtttctcag 
tcccaatgcg 
agtgtcttgc 
tcggtcgttg 
cagatacatg 
ctgcggggtg 
aatgtctttc 
gcggttaagc 
taggttggct 
cacagtgtat 
gaacttggag 
aatgggccca 
gtgttccagg 
ctgcggtata 
ccacgctttg 



aatttgggcg 
tgaaatctga 
ctttgaccgt 
gcgtatctga 
taggagtccc 
tatgcaatac 
ggagagaaaa 
ttaggaaggc 
gagatattgt 
ttggtgtgca 
gctgttgggc 
aaacccgtcg 
ccggatcgga 
ggatcccccg 
ggcaagcgcc 
ttcgaccagg 
cggcagcaag 
tatatagacg 
ggttcgcgcg 
gcttccgaga 
tcggccgggg 
gccgtgaccg 
ccgcccccgg 
ccggccgcgc 
ctcatcccgc 
cacatcgacc 
gccgcgatt c 
gggtcgtggt 
gagccccaga 
cgggcccccg 
gacgtcttgg 
cccgccggct 
accaccccag 
ggggaggcta 
aacttgttta 
aataaagcat 
tatcatgtct 
aggtgcagac 
gtgaccgagg 
tctagcgatg 
aagaatatat 
gccatgagca 
cccccatggg 
ctgcccgcaa 
gcagcctccg 
gctttcctga 
ttgacggctc 
cagctgttgg 
gtttaaaaca 
tgtctttatt 
agggtcctgt 
ggcataagcc 
gtgttgtaga 
agtagcaagc 
tgggatgggt 
atgttcccag 
ccggtgcact 
acgcccttgt 
cgggcggcgg 
atgagatcgt 
atggttccat 
agttcagatg 



taaccgagta 
ataattttgt 
ttacgtggag 
ggggactagg 
ctcaggatat 
tcttgtagtc 
agcaccgtgc 
aacagacggg 
atttaagtgc 
cctccggccc 
tcgcggttga 
gcctccgaac 
aaacctctcg 
ggctgcagga 
ttgtagaagc 
ctgcgcgttc 
aagccacgga 
gtcctcacgg 
acgatatcgt 
caatcgcgaa 
acgcggcggt 
acgccgttct 
ccctcaccct 
gataccttat 
cgaccttgcc 
gcctggccaa 
gccgcgttta 
gggaggattg 
gcaacgcggg 
agttgctggc 
ccaaacgcct 
gccgggacgc 
gctccatacc 
actgactcga 
ttgcagctta 
ttttttcact 
ggatccgacc 
cctgcgagtg 
agctgaggcc 
aagatacaga 
aaggtggggg 
ccaactcgtt 
ccggggtgcg 
actctactac 
ccgccgcttc 
gcccgcttgc 
ttttggcaca 
atctgcgcca 
taaataaaaa 
taggggtttt 
gtattttttc 
cgtctctggg 
tgatccagtc 
tgattgccag 
gcatacgtgg 
ccatatccct 
tgggaaattt 
gacctccaag 
cctgggcgaa 
cataggccat 
ccggcccagg 
gggggatcat 



agatttggcc 
gttactcata 
actcgcccag 
gtgtgtttag 
agtagtttcg 
ttgcaacatg 
atgccgattg 
tctgacatgg 
ctagctcgat 
atatggccac 
ggacaaactc 
ggtactccgc 
agaaaggcgt 
attcgatgat 
gcgtatggct 
tcgcggccat 
agtccgcctg 
gatggggaaa 
ctacgtaccc 
catctacacc 
ggtaatgaca 
ggctcctcat 
catcttcgac 
gggcagcatg 
cggcacaaac 
acgccagcgc 
cgggctgctt 
gggacagctt 
cccacgaccc 
ccccaacggc 
ccgtcccatg 
cctgctgcaa 
gacgatctgc 
gaagcttggg 
taatggttac 
gcattctagt 
tcggatctgg 
tggcggtaaa 
cgatcacttg 
ttgaggtact 
tcttatgtag 
tgatggaagc 
tcagaatgtg 
cttgacctac 
agccgctgca 
aagcagtgca 
attggattct 
gcaggtttct 
accagactct 
gcgcgcgcgg 
caggacgtgg 
gtggaggtag 
gtagcaggag 
gggcaggccc 
ggatatgaga 
ccggggattc 
gtcatgtagc 
attttccatg 
gatatttctg 
ttttacaaag 
ggcgtagtta 
gtctacctgc 



attttcgcgg 
gcgcgtaata 
ggcgcgcccc 
gcgaaaagcg 
cttttgcata 
gtaacgatga 
gtggaagtaa 
attggacgaa 
acaataaacg 
tctcttccgc 
ttcgcggtct 
cgccgaggga 
gtaaccagtc 
cttggtggcg 
tcgtacccct 
agcaaccgac 
gagcagaaaa 
accaccacca 
gagccgatga 
acacaacacc 
agcgcccaga 
gtcggggggg 
cgccatccca 
accccccagg 
atcgtgttgg 
cccggcgagc 
gccaatacgg 
tcggggacgg 
catatcgggg 
gacctgtata 
cacgtcttta 
cttacctccg 
gacctggcgc 
cccatcgatc 
aaataaagca 
tgtggtttgt 
aaggtgctga 
catattagga 
gtgctggcct 
gaaatgtgtg 
ttttgtatct 
attgtgagct 
atgggctcca 
gagaccgtgt 
gccaccgccc 
gcttcccgtt 
ttgacccggg 
gccctgaagg 
gtttggattt 
taggcccggg 
taaaggtgac 
caccactgca 
cgctgggcgt 
ttggtgtaag 
tgcatcttgg 
atgttgtgca 
ttagaaggaa 
cattcgtcca 
ggatcactaa 
cgcgggcgga 
ccctcacaga 
ggggcgatga 



gaaaactgaa 

tttgtctagg 

gatgtacggg 

gggcttcggt 

gggaggggga 

gttagcaaca 

ggtggtacga 

ccactgaatt 

ccatttgacc 

atcgctgtct 

ttccagtact 

cctgagcgag 

acagtcgctc 

tgaaactccc 

gccatcaaca 

gtacggcgtt 

tgcccacgct 

cgcaactgct 

cttactggca 

gcctcgacca 

taacaatggg 

aggctgggag 

tcgccgccct 

ccgtgctggc 

gggcccttcc 

ggcttgacct 

tgcggtatct 

ccgtgccgcc 

acacgttatt 

acgtgtttgc 

tcctggatta 

ggatggtcca 

gcacgtttgc 

aagcttatcg 

atagcatcac 

ccaaactcat 

ggtacgatga 

accagcctgt 

gcacccgcgc 

ggcgtggctt 

gttttgcagc 

catatttgac 

gcattgatgg 

ctggaacgcc 

gcgggattgt 

catccgcccg 

aacttaatgt 

cttcctcccc 

ggatcaagca 

accagcggtc 

tctggatgtt 

gagcttcatg 

ggtgcctaaa 

tgtttacaaa 

actgtatttt 

gaaccaccag 

atgcgtggaa 

taatgatggc 

cgtcatagtt 

gggtgccaga 

tttgcatttc 

agaaaacggt 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660" 

3720 

3780 

3840 

3900 

3960 



ttccggggta ggggagatca gctgggaaga 
gcagccggtg ggcccgtaaa tcacacctat 
gcagctgccg tcatccctga gcaggggggc 
gttttccctg accaaatccg ccagaaggcg 
ggaagcaaag tttttcaacg gtttgagacc 
accaagcagt tccaggcggt cccacagctc 
catatctcct cgtttcgcgg gttggggcgg 
tccagacggg ccagggtcat gtctttccac 
gtcacggtga aggggtgcgc tccgggctgc 
ctgctggtgc tgaagcgctg ccggtcttcg 
atggtgtcat agtccagccc ctccgcggcg 
gaggcgccgc acgaggggca gtgcagactt 
accgattccg gggagtaggc atccgcgccg 
agccaggtga gctctggccg ttcggggtca 
cgtttcttac ctctggtttc catgagccgg 
gtgtccccgt atacagactt gagaggcctg 
tatagaaact cggaccactc tgagacaaag 
aagtgggagg ggtagcggtc gttgtccact 
cacatgtcgc cctcttcggc atcaaggaag 
ccgggtgttc ctgaaggggg gctataaaag 
tccgcatcgc tgtctgcgag ggccagctgt 
atgacttctg cgctaagatt gtcagtttcc 
cccgcggtga tgcctttgag ggtggccgca 
ttgtcaagct tcgagggggg gcccggtacc 
tgcgcgcttg gcgtaatcat ggtcatagct 
aattccacac aacatacgag ccggaagcat 
gagctaactc acattaattg cgttgcgctc 
gtgccagctg cattaatgaa tcggccaacg 
ctcttccgct tcctcgctca ctgactcgct 
atcagctcac tcaaaggcgg taatacggtt 
gaacatgtga gcaaaaggcc agcaaaaggc 
gtttttccat aggctccgcc cccctgacga 
gtggcgaaac ccgacaggac tataaagata 
gcgctctcct gttccgaccc tgccgcttac 
aagcgtggcg ctttctcata gctcacgctg 
ctccaagctg ggctgtgtgc acgaaccccc 
taactatcgt cttgagtcca acccggtaag 
tggtaacagg attagcagag cgaggtatgt 
gcctaactac ggctacacta gaaggacagt 
taccttcgga aaaagagttg gtagctcttg 
tggttttttt gtttgcaagc agcagattac 
tttgatcttt tctacggggt ctgacgctca 
ggtcatgaga ttatcaaaaa ggatcttcac 
taaatcaatc taaagtatat atgagtaaac 
tgaggcacct atctcagcga tctgtctatt 
cgtgtagata actacgatac gggagggctt 
gcgagaccca cgctcaccgg ctccagattt 
cgagcgcaga agtggtcctg caactttatc 
ggaagctaga gtaagtagtt cgccagttaa 
aggcatcgtg gtgtcacgct cgtcgtttgg 
atcaaggcga gttacatgat cccccatgtt 
tccgatcgtt gtcagaagta agttggccgc 
gcataattct cttactgtca tgccatccgt 
aaccaagtca ttctgagaat agtgtatgcg 
acgggataat accgcgccac atagcagaac 
ttcggggcga aaactctcaa ggatcttacc 
tcgtgcaccc aactgatctt cagcatcttt 
aacaggaagg caaaatgccg caaaaaaggg 
catactcttc ctttttcaat attattgaag 
atacatattt gaatgtattt agaaaaataa 
aaaagtgcca cctgacgcgc cctgtagcgg 
gcgcagcgtg accgctacac ttgccagcgc 
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aagcaggttc ctgagcagct gcgacttacc 4 020 
taccggctgc aactggtagt taagagagct 4080 
cacttcgtta agcatgtccc tgactcgcat 414 0 
ctcgccgccc agcgatagca gttcttgcaa 4200 
gtccgccgta ggcatgcttt tgagcgtttg 4260 
ggtcacctgc tctacggcat ctcgatccag 4320 
ctttcgctgt acggcagtag tcggtgctcg 4380 
gggcgcaggg tcctcgtcag cgtagtctgg 4440 
gcgctggcca gggtgcgctt gaggctggtc 4500 
ccctgcgcgt cggccaggta gcatttgacc 4560 
tggcccttgg cgcgcagctt gcccttggag 4 620 
ttgagggcgt agagcttggg cgcgagaaat 4 680 
caggccccgc agacggtctc gcattccacg 4740 
aaaaccaggt ttcccccatg ctttttgatg 4800 
tgtccacgct cggtgacgaa aaggctgtcc 4 8 60 
tcctcgagcg gtgttccgcg gtcctcctcg 4 920 
gctcgcgtcc aggccagcac gaaggaggct 4 980 
agggggtcca ctcgctccag ggtgtgaaga 504 0 
gtgattggtt tgtaggtgta ggccacgtga 5100 
ggggtggggg cgcgttcgtc ctcactctct 5160 
tggggtgagt actccctctg aaaagcgggc 5220 
aaaaacgagg aggatttgat attcacctgg 5280 
tccatctggt cagaaaagac aatctttttg 5340 
cagcttttgt tccctttagt gagggttaat 5400 
gtttcctgtg tgaaattgtt atccgctcac 54 60 
aaagtgtaaa gcctggggtg cctaatgagt 5520 
actgcccgct ttccagtcgg gaaacctgtc 5580 
cgcggggaga ggcggtttgc gtattgggcg 5 64 0 
gcgctcggtc gttcggctgc ggcgagcggt 5700 
atccacagaa tcaggggata acgcaggaaa 57 60 
caggaaccgt aaaaaggccg cgttgctggc 5820 
gcatcacaaa aatcgacgct caagtcagag 588 0 
ccaggcgttt ccccctggaa gctccctcgt 594 0 
cggatacctg tccgcctttc tcccttcggg 6000 
taggtatctc agttcggtgt aggtcgttcg 6060 
cgttcagccc gaccgctgcg ccttatccgg 6120 
acacgactta tcgccactgg cagcagccac 6180 
aggcggtgct acagagttct tgaagtggtg 624 0 
atttggtatc tgcgctctgc tgaagccagt 6300 
atccggcaaa caaaccaccg ctggtagcgg 6360 
gcgcagaaaa aaaggatctc aagaagatcc 64 20 
gtggaacgaa aactcacgtt aagggatttt 6480 
ctagatcctt ttaaattaaa aatgaagttt 6540 
ttggtctgac agttaccaat gcttaatcag 6 600 
tcgttcatcc atagttgcct gactccccgt 6660 
accatctggc cccagtgctg caatgatacc 6720 
atcagcaata aaccagccag ccggaagggc 67 80 
cgcctccatc cagtctatta attgttgccg 6840 
tagtttgcgc aacgttgttg ccattgctac 6900 
tatggcttca ttcagctccg gttcccaacg 6960 
gtgcaaaaaa gcggttagct ccttcggtcc 7020 
agtgttatca ctcatggtta tggcagcact 7080 
aagatgcttt tctgtgactg gtgagtactc 7140 
gcgaccgagt tgctcttgcc cggcgtcaat 7200 
tttaaaagtg ctcatcattg gaaaacgttc 7260 
gctgttgaga tccagttcga tgtaacccac 7320 
tactttcacc agcgtttctg ggtgagcaaa 7380~ 
aataagggcg acacggaaat gttgaatact 7440 
catttatcag ggttattgtc tcatgagcgg 7500 
acaaataggg gttccgcgca catttccccg 7560 
cgcattaagc gcggcgggtg tggtggttac 7620 
cctagcgccc gctcctttcg ctttcttccc 7680 
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ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt 77 4 0 

agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg 7800 

ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac 78 60 

gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta tctcggtcta 7920 

ttcttttgat ttataaggga ttttgcgatt tcggcctatt ggttaaaaaa tgagctgatt 7 980 

taacaaaaat ttaacgcgaa ttttaacaaa atattaacgc ttacaatttc cattcgccat 8040 

tcaggctgcg caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc 8100 

tggcgaaagg gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt 8160 

cacgacgttg taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat 8220 

tggagctcca ccgcggtg 8238 



<210> 51 
<211> 11 
<212> DNA 1 
<213> adenovirus 

<400> 51 
cgcggatccc g 



<210> 52 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 52 

ctgacaaact cagatcttgt ttattg 



<210> 53 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 53 

gtcgactcta gaggatccag a ^ x 



<210> 54 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 54 

ccggactcta gatggcaacc atggcgctac 



<210> 55 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



-51- 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 55 

ggaggggaag cttggccctc agccagcctc t 



<210> 56 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 56 

tgtcttggat ccaagatgaa gcgcgcccgc cccagcgaag atgacttc 



<210> 57 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 57 

aaacacggcg gccgctcttt cattcttg 



<210> 58 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 58 

cgcgctgact cttaggacta gtttc 



<210> 59 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 59 

gcgcttaatt aacatcatca ataatatacc ttatttt 



<210> 60 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 60 

tgaagcgcgc aagaccgtct gaag 24 



<210> 61 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 61 

cataacactg cagattcttt attcttgg 28 



<210> 62 

<211> 47 

<212> DNA 

<213> Artificial 



Sequence 



<220> ^ 
<223> Description of Artificial Sequence: primer 

<400> 62 

ggtacacagg aaacaggagg ttccggaggt ggaggagaca caactcc 47 



<210> 63 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic peptide 
<400> 63 

Gly Gly Ser Gly Gly Gly 
1 5 



<210> 64 
<211> 7231 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: plasmid 



<400> 64 

ctgctccctg 

acaaggcaag 

ctgcttcgcg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 



cttgtgtgtt 
gcttgaccga 
atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 



ggaggtcgct 
caattgcatg 
cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 



gagtagtgcg 
aagaatctgc 
cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 



cgagcaaaat 
ttagggttag 
gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 



ttaagctaca 60 
gcgttttgcg 120 
tagttattaa 180 
cgttacataa 240 
gacgtcaata 300 
atgggtggac 360 
aagtacgccc 420 
catgacctta 4 80 
catggtgatg 540 
atttccaagt 600 
ggactttcca 660 
acggtgggag 720 
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gtctatataa gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa 780 

attaatacga ctcactatag ggagacccaa gctggctagc gtttaaactt aagcttggta 840 

ccgagctcgg atccactctc ttccgcatcg ctgtctgcga gggccagctg ttggggtgag 900 

tactccctct gaaaagcggg catgacttct gcgctaagat tgtcagtttc caaaaacgag 960 

gaggatttga tattcacctg gcccgcggtg atgcctttga gggtggccgc atccatctgg 1020 

tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa acgacccgta gagggcgttg 1080 

gacagcaact tggcgatgga gcgcagggtt tggtttttgt cgcgatcggc gcgctccttg 1140 

gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc gccattcggg aaagacggtg 1200 

gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt tgtgcagggt gacaaggtca 12 60 

acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc agcagaggcg gccgcccttg 1320 

cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt ccggggggtc tgcgtccacg 138 0 

gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta tcttgcatcc ttgcaagtct 144 0 

agcgcctgct gccatgcgcg ggcggcaagc gcgcgctcgt atgggttgag tgggggaccc 1500 

catggcatgg ggtgggtgag cgcggaggcg tacatgccgc aaatgtcgta aacgtagagg 15 60 

ggctctctga gtattccaag atatgtaggg tagcatcttc caccgcggat gctggcgcgc 1620 

acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg gaccgaggtt gctacgggcg 1680 

ggctgctctg ctcggaagac tatctgcctg aagatggcat gtgagttgga tgatatggtt 1740 

ggacgctgga agacgttgaa gctggcgtct gtgagaccta ccgcgtcacg cacgaaggag 1800 

gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga cctgcacgtc tagggcgcag 18 60 

tagtccaggg tttccttgat gatgtcatac ttatcctgtc cctttttttt ccacagctcg 1920 

cggttgagga caaactcttc gcggtctttc cagtactctt ggatcggaaa cccgtcggcc 1980 

tccgaacgag atccgtactc cgccgccgag ggacctgagc gagtccgcat ^gaccggatc 204 0 

ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg caagatccaa gatgaagcgc 2100 

gcccgcccca gcgaagatga cttcaacccc gtctacccct atggctacgc gcggaatcag 2160 

aatatcccct tcctcactcc cccctttgtc tcctccgatg gattcaaaaa cttcccccct 2220 

ggggtactgt cactcaaact ggctgatcca atcaccatta ccaatgggga tgtatccctc 2280 

aaggtgggag gtggtctcac tttgcaagat ggaagcctaa ctgtaaaccc taaggctcca 234 0 

ctgcaagtta atactgataa aaaacttgag cttgcatatg ataatccatt tgaaagtagt 2400 

gctaataaac ttagtttaaa agtaggacat ggattaaaag tattagatga aaaaagtgct 24 60 

gcggggttaa aagatttaat tggcaaactt gtggttttaa caggaaaagg aataggcact 2520 

gaaaatttag aaaatacaga tggtagcagc agaggaattg gtataaatgt aagagcaaga 2580 

gaagggttga catttgacaa tgatggatac ttggtagcat ggaacccaaa gtatgacacg 2640 

cgcacacttt ggacaacacc agacacatct ccaaactgca caattgctca agataaggac 2700 

tctaaactca ctttggtact tacaaagtgt ggaagtcaaa tattagctaa tgtgtctttg 2760 

attgtggtcg caggaaagta ccacatcata aataataaga caaatccaaa aataaaaagt 2820 

tttactatta aactgctatt taataagaac ggagtgcttt tagacaactc aaatcttgga 2880 

aaagcttatt ggaactttag aagtggaaat tccaatgttt cgacagctta tgaaaaagca 2940 

attggtttta tgcctaattt ggtagcgtat ccaaaaccca gtaattctaa aaaatatgca 3000 

agagacatag tttatggaac tatatatctt ggtggaaaac ctgatcagcc agcagtcatt 3060 

aaaactacct ttaaccaaga aactggatgt gaatactcta tcacatttaa ctttagttgg 3120 

tccaaaacct atgaaaatgt tgaatttgaa accacctctt ttaccttctc ctatattgcc 3180 

caagaatgaa agagcggccg ctcgagtcta gagggcccgt ttaaacccgc tgatcagcct 324 0 

cgactgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg ccttccttga 3300 

ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt gcatcgcatt 3360 

gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc aagggggagg 34 2 0 

attgggaaga caatagcagg catgctgggg atgcggtggg ctctatggct tctgaggcgg 3480 

aaagaaccag ctggggctct agggggtatc cccacgcgcc ctgtagcggc gcattaagcg 3540 

cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg 3600 

ctcctttcgc tttcttccct tcctttctcg , ccacgttcgc cggctttccc cgtcaagctc 3660 

taaatcgggg catcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 3720 

aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 3780 

ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 3840 

tcaaccctat ctcggtctat tcttttgatt tataagggat tttggggatt tcggcctatt 3900 

ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttaattctgt ggaatgtgtg 3960 

tcagttaggg tgtggaaagt ccccaggctc cccaggcagg cagaagtatg caaagcatgc 4 020 

atctcaatta gtcagcaacc aggtgtggaa agtccccagg ctccccagca ggcagaagta 4 080_ 

tgcaaagcat gcatctcaat tagtcagcaa ccatagtccc gcccctaact ccgcccatcc 4140 

cgcccctaac tccgcccagt tccgcccatt ctccgcccca tggctgacta atttttttta 4200 

tttatgcaga ggccgaggcc gcctctgcct ctgagctatt ccagaagtag tgaggaggct 4260 

tttttggagg cctaggcttt tgcaaaaagc tcccgggagc ttgtatatcc attttcggat 4 320 

ctgatcagca cgtgttgaca attaatcatc ggcatagtat atcggcatag tataatacga 4 380 

caaggtgagg aactaaacca tggccaagtt gaccagtgcc gttccggtgc tcaccgcgcg 4 4 40 
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cgacgtcgcc 
ggaggacgac 
ggaccaggtg 
gtacgccgag 
gaccgagatc 
ctgcgtgcac 
cgccgccttc 
cctccagcgc 
ttataatggt 
actgcattct 
gtcgacctct 
ttatccgctc 
tgcctaatga 
gggaaacctg 
gcgtattggg 
gcggcgagcg 
taacgcagga 
cgcgttgctg 
ctcaagtcag 
aagctccctc 
tctcccttcg 
gtaggtcgtt 
cgccttatcc 
ggcagcagcc 
cttgaagtgg 
gctgaagcca 
cgctggtagc 
tcaagaagat 
ttaagggatt 
aaaatgaagt 
atgcttaatc 
ctgactcccc 
tgcaatgata 
agccggaagg 
taattgttgc 
tgccattgct 
cggttcccaa 
ctccttcggt 
tatggcagca 
tggtgagtac 
cccggcgtca 
tggaaaacgt 
gatgtaaccc 
tgggtgagca 
atgttgaata 
tctcatgagc 
cacatttccc 



ggagcggtcg 
ttcgccggtg 
gtgccggaca 
tggtcggagg 
ggcgagcagc 
ttcgtggccg 
tatgaaaggt 
ggggatctca 
tacaaataaa 
agttgtggtt 
agctagagct 
acaattccac 
gtgagctaac 
tcgtgccagc 
cgctcttccg 
gtatcagctc 
aagaacatgt 
gcgtttttcc 
aggtggcgaa 
gtgcgctctc 
ggaagcgtgg 
cgctccaagc 
ggtaactatc 
actggtaaca 
tggcctaact 
gttaccttcg 
ggtggttttt 
cctttgatct 
ttggtcatga 
tttaaatcaa 
agtgaggcac 
gtcgtgtaga 
ccgcgagacc 
gccgagcgca 
cgggaagcta 
acaggcatcg 
cgatcaaggc 
cctccgatcg 
ctgcataatt 
tcaaccaagt 
atacgggata 
tcttcggggc 
actcgtgcac 
aaaacaggaa 
ctcatactct 
ggatacatat 
cgaaaagtgc 



agttctggac 
tggtccggga 
acaccctggc 
tcgtgtccac 
cgtgggggcg 
aggagcagga 
tgggcttcgg 
tgctggagtt 
gcaatagcat 
tgtccaaact 
tggcgtaatc 
acaacatacg 
tcacattaat 
tgcattaatg 
cttcctcgct 
actcaaaggc 
gagcaaaagg 
ataggctccg 
acccgacagg 
ctgttccgac 
cgctttctca 
tgggctgtgt 
gtcttgagtc 
ggattagcag 
acggctacac 
gaaaaagagt 
ttgtttgcaa 
tttctacggg 
gattatcaaa 
tctaaagtat 
ctatctcagc 
taactacgat 
cacgctcacc 
gaagtggtcc 
gagtaagtag 
tggtgtcacg 
gagt tacatg 
ttgtcagaag 
ctcttactgt 
cattctgaga 
ataccgcgcc 
gaaaactctc 
ccaactgatc 
ggcaaaatgc 
tcctttttca 
ttgaatgtat 
cacctgacgt 



cgaccggctc 
cgacgtgacc 
ctgggtgtgg 
gaacttccgg 
ggagttcgcc 
ctgacacgtg 
aatcgttttc 
cttcgcccac 
cacaaatttc 
catcaatgta 
atggtcatag 
agccggaagc 
tgcgttgcgc 
aatcggccaa 
cactgactcg 
ggtaatacgg 
ccagcaaaag 
cccccctgac 
actataaaga 
cctgccgctt 
atgctcacgc 
gcacgaaccc 
caacccggta 
agcgaggtat 
tagaaggaca 
tggtagctct 
gcagcagatt 
gtctgacgct 
aaggatcttc 
atatgagtaa 
gatctgtcta 
acgggagggc 
ggct ccagat 
tgcaacttta 
ttcgccagtt 
ctcgtcgttt 
atcccccatg 
taagttggcc 
catgccatcc 
atagtgtatg 
acatagcaga 
aaggatctta 
ttcagcatct 
cgcaaaaaag 
atattattga 
ttagaaaaat 
c 



gggttctccc 
ctgttcatca 
gtgcgcggcc 
gacgcctccg 
ctgcgcgacc 
ctacgagatt 
cgggacgccg 
cccaacttgt 
acaaataaag 
tcttatcatg 
ctgtttcctg 
ataaagtgta 
tcactgcccg 
cgcgcgggga 
ctgcgctcgg 
ttatccacag 
gccaggaacc 
gagcatcaca 
taccaggcgt 
accggatacc 
tgtaggtatc 
cccgttcagc 
agacacgact 
gtaggcggtg 
gtatttggta 
tgatccggca 
acgcgcagaa 
cagtggaacg 
acctagatcc 
acttggtctg 
tttcgttcat 
ttaccatctg 
ttatcagcaa 
tccgcctcca 
aatagtttgc 
ggtatggctt 
ttgtgcaaaa 
gcagtgttat 
gtaagatgct 
cggcgaccga 
actttaaaag 
ccgctgttga 
tttactttca 
ggaataaggg 
agcatttatc 
aaacaaatag 



gggacttcgt 

gcgcggtcca 

tggacgagct 

ggccggccat 

cggccggcaa 

tcgattccac 

gctggatgat 

ttattgcagc 

catttttttc 

tctgtatacc 

tgtgaaattg 

aagcctgggg 

ctttccagtc 

gaggcggttt 

tcgttcggct 

aatcagggga 

gtaaaaaggc 

aaaatcgacg 

ttccccctgg 

tgtccgcctt 

tcagttcggt 

<?cgaccgctg 

tatcgccact 

ctacagagtt 

tctgcgctct 

aacaaaccac 

aaaaaggatc 

aaaactcacg 

ttttaaatta 

acagttacca 

ccatagttgc 

gccccagtgc 

taaaccagcc 

tccagtctat 

gcaacgttgt 

cattcagctc 

aagcggttag 

cactcatggt 

tttctgtgac 

gttgctcttg 

tgctcatcat 

gatccagttc 

ccagcgtttc 

cgacacggaa 

agggttattg 

gggttccgcg 



4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6S40 

6900 

6960 

7020 

7080 

7140 

7200 

7231 



<210> 65 
<211> 8484 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 65 

ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg cgagcaaaat ttaagctaca 60 
acaaggcaag gcttgaccga caattgcatg aagaatctgc ttagggttag gcgttttgcg 120 
ctgcttcgcg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 180 
tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 240 



-55- 



cttacggtaa 
atgacgtatg 
tatttacggt 
cctattgacg 
tgggactttc 
cggttttggc 
ctccacccca 
aaatgtcgta 
gtctatataa 
attaatacga 
ccgagctcgg 
tactccctct 
gaggatttga 
tcagaaaaga 
gacagcaact 
gccgcgatgt 
gtgcgctcgt 
acgctggtgg 
cgcgagcaga 
gtaaagaccc 
agcgcctgct 
catggcatgg 
ggctctctga 
acgtaatcgt 
ggctgctctg 
ggacgctgga 
gcgtaggagt 
tagtccaggg 
cggttgagga 
tccgaacgag 
ggaaaacctc 
gcaagaccgt 
ccaactgtgc 
cctggggtac 
ctcaaaatgg 
accactgtga 
cccctcacag 
ggcaacacac 
attgccaccc 
cccctcacca 
gccactggta 
ggactaaagt 
actggtccag 
ggttttgatt 
caaaacagac 
ctaagactag 
aacaaaggcc 
agcactgcca 
cttgaatttg 
ggcctagaat 
gacagcacag 
acaccagctc 
gtcttaacaa 
agtttggctc 
aatggagtgc 
gatcttactg 
tatccaaaat 
ggagacaaaa 
gacacaactc 
attaatgaaa 
aagcggccgc 
actggtattc 



atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 
atccactctc 
gaaaagcggg 
tattcacctg 
caatcttttt 
tggcgatgga 
ttagctgcac 
cgggcaccag 
ctacctctcc 
atggcggtag 
cgggcagcag 
gccatgcgcg 
ggtgggtgag 
gtattccaag 
atagttcgtg 
ctcggaagac 
agacgttgaa 
cgcgcagctt 
tttccttgat 
caaactcttc 
atccgtactc 
tcgagaaagg 
ctgaagatac 
cttttcttac 
tctctttgcg 
gcaacggcct 
gcccacctct 
ttacctcaga 
tcaccatgca 
aaggacccct 
ccaccgatag 
gcttgggcat 
acggggctcc 
gtgtgactat 
cacaaggcaa 
gccttatact 
gacagggccc 
tttacttgtt 
aggggttgat 
gttcacctaa 
ttgattcaaa 
gtgccattac 
catctcctaa 
aatgtggcag 
caatatctgg 
tactaaacaa 
aaggcacagc 
ctcacggtaa 
ctaaacctgt 
caagtgcata 
tatttgccac 
tcgagtctag 
ttaactatgt 



tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
ttccgcatcg 
catgacttct 
gcccgcggtg 
gttgtcaagc 
gcgcagggtt 
gtattcgcgc 
gtgcacgcgc 
gcgtaggcgc 
ggggtctagc 
gcgcgcgtcg 
ggcggcaagc 
cgcggaggcg 
atatgtaggg 
cgagggagcg 
tatctgcctg 
gctggcgtct 
gttgaccagc 
gatgtcatac 
gcggtctttc 
cgccgccgag 
cgtctaacca 
cttcaacccc 
tcctcccttt 
cctatccgaa 
ctctctggac 
caaaaaaacc 
agccctaact 
atcacaggcc 
cacagtgtca 
cagtaccctt 
tgacttgaaa 
tttgcatgta 
taataatact 
tatgcaactt 
tgatgttagt 
tctttttata 
tacagcttca 
gtttgacgct 
tgcaccaaac 
caaggctatg 
agtaggaaac 
ctgtagacta 
tcaaatactt 
aacagttcaa 
ttccttcctg 
ctatacaaac 
aactgccaaa 
aacactaacc 
ctctatgtca 
atcctcttac 
cgataatcaa 
tgctcctttt 



cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gctggctagc 
ctgtctgcga 
gcgctaagat 
atgcctttga 
ttggtggcaa 
tggtttttgt 
gcaacgcacc 
caaccgcggt 
tcgttggtcc 
tgcgtctcgt 
aagtagtcta 
gcgcgctcgt 
tacatgccgc 
tagcatcttc 
aggaggtcgg 
aagatggcat 
gtgagaccta 
tcggcggtga 
ttatcctgtc 
cagtactctt 
ggacctgagc 
gtcacagtcg 
gtgtatccat 
gtatccccca 
cctctagtta 
gaggccggca 
aagtcaaaca 
gtggctgccg 
ccgctaaccg 
gaaggaaagc 
actatcactg 
gagcccattt 
acagacgacc 
tccttgcaaa 
aatgtagcag 
tatccgtttg 
aactcagccc 
aacaattcca 
acagccatag 
acaaatcccc 
gttcctaaac 
aaaaataatg 
aatgcagaga 
gctacagttt 
agtgctcatc 
gacccagaat 
gctgttggat 
agtaacattg 
attacactaa 
ttttcatggg 
actttttcat 
cctctggatt 
acgctatgtg 



cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 

gtaggcgtgt 

ctgcttactg 

gtttaaactt 

gggccagctg 

tgtcagtttc 

gggtggccgc 

acgacccgta 

cgcgatcggc 

gccattcggg 

tgtgcagggt 

agcagaggcg 

ccggggggtc 

tcttgcatcc 

atgggttgag 

aaatgtcgta 

caccgcggat 

gaccgaggtt 

gtgagttgga 

ccgcgtcacg 

cctgcacgtc 

cctttttttt 

ggatcggaaa 

gagtccgcat 

caagatccaa 

atgacacgga 

atgggtttca 

cctccaatgg 

accttacctc 

taaacctgga 

ccgcacctct 

tgcacgactc 

tagccctgca 

cctcaccccc 

atacacaaaa 

taaacacttt 

ctaaagttac 

gaggactaag 

atgctcaaaa 

acaacttgga 

aaaagcttga 

ccattaatgc 

tcaaaacaaa 

taggaactgg 

ataagctaac 

aagatgctaa 

cagttttggc 

ttattataag 

attggaactt 

ttatgcctaa 

tcagtcaagt 

acggtacaca 

actggtctgg 

acattgccca 

acaaaatttg 

gatacgctgc 



gacgtcaata 

atgggtggac 

aagtacgccc 

catgacctta 

catggtgatg 

atttccaagt 

ggactttcca 

acggtgggag 

gcttatcgaa 

aagcttggta 

ttggggtgag 

caaaaacgag 

atccatctgg 

gagggcgttg 

gcgctccttg 

aaagacggtg 

gacaaggtca 

gccgcccttg 

tgcgtccacg 

ttgcaagtct 

tgggggaccc 

aacgtagagg 

gctggcgcgc 

gctacgggcg 

tgatatggtt 

cacgaaggag 

tagggcgcag 

ccacagctcg 

cccgtcggcc 

cgaccggatc 

gatgaagcgc 

aaccggtcct 

agagagtccc 

catgcttgcg 

ccaaaatgta 

aatatctgca 

aatggtcgcg 

caaacttagc 

aacatcaggc 

tctaactact 

tggaaaacta 

gaccgtagca 

tggagccttg 

gattgattct 

ccaactaaat 

tattaactac 

ggttaaccta 

aggagatggg 

aattggccat 

ccttagtttt 

tttgtggacc 

actcactttg 

tgttaaaggc 

atttgacgaa 

tagaaatgga 

cctatcagct 

ttacttaaac 

ggaaacagga 

ccacaactac 

agaataaaag 

tgaaagattg 

tttaatgcct 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660" 

3720 

3780 

3840 

3900 

3960 
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ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta taaatcctgg 4020 
ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt ggtgtgcact 4080 
gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca gctcctttcc 4140 
gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc ctgccttgcc 4200 
cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt gtcggggaag 4260 
ctgacgtcct ttccatggct gctcgcctgt gttgccacct ggattctgcg cgggacgtcc 4320 
ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg cctgctgccg 4380 
gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat ctccctttgg 4440 
gccgcctccc cgcctgatcg ctagagggcc cgtttaaacc cgctgatcag cctcgactgt 4500 
gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga 4560 
aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag 4 620 
taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga 4 680 
agacaatagc aggcatgctg gggatgcggt gggctctatg gcttctgagg cggaaagaac 4740 
cagctggggc tctagggggt atccccacgc gccctgtagc ggcgcattaa gcgcggcggg 4800 
tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt 4860 
cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg 4 920 
gggcatccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga 4 980 
ttagggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac 5040 
gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc 5100 
tatctcggtc tattcttttg atttataagg gattttgggg atttcggcct attggttaaa 5160 
aaatgagctg atttaacaaa aatttaacgc gaattaattc tgtggaatgt gtgtcagtta 5220 
gggtgtggaa agtccccagg ctccccaggc aggcagaagt atgcaaagca tgcatctcaa 5280 
ttagtcagca accaggtgtg gaaagtcccc aggctcccca gcaggcagaa gtatgcaaag 5340 
catgcatctc aattagtcag caaccatagt cccgccccta actccgccca tcccgcccct 54 00 
aactccgccc agttccgccc attctccgcc ccatggctga ctaatttttt ttatttatgc 5460 
agaggccgag gccgcctctg cctctgagct attccagaag tagtgaggag gcttttttgg 5520 
aggcctaggc ttttgcaaaa agctcccggg agcttgtata tccattttcg gatctgatca 5580 
gcacgtgttg acaattaatc atcggcatag tatatcggca tagtataata cgacaaggtg 5640 
aggaactaaa ccatggccaa gttgaccagt gccgttccgg tgctcaccgc gcgcgacgtc 5700 
gccggagcgg tcgagttctg gaccgaccgg ctcgggttct cccgggactt cgtggaggac 5760 
gacttcgccg gtgtggtccg ggacgacgtg accctgttca tcagcgcggt ccaggaccag 5820 
gtggtgccgg acaacaccct ggcctgggtg tgggtgcgcg gcctggacga gctgtacgcc 5880 
gagtggtcgg aggtcgtgtc cacgaacttc cgggacgcct ccgggccggc catgaccgag 5940 
atcggcgagc agccgtgggg gcgggagttc gccctgcgcg acccggccgg caactgcgtg 6000 
cacttcgtgg ccgaggagca ggactgacac gtgctacgag atttcgattc caccgccgcc 6060 
ttctatgaaa ggttgggctt cggaatcgtt ttccgggacg ccggctggat gatcctccag 6120 
cgcggggatc tcatgctgga gttcttcgcc caccccaact tgtttattgc agcttataat 6180 
ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt ttcactgcat 6240 
tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctgtat accgtcgacc 6300 
tctagctaga gcttggcgta atcatggtca tagctgtttc ctgtgtgaaa ttgttatccg 6360 
ctcacaattc cacacaacat acgagccgga agcataaagt gtaaagcctg gggtgcctaa 6420 
tgagtgagct aactcacatt aattgcgttg cgctcactgc ccgctttcca gtcgggaaac 64 80 
ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt 6540 
gggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga 6600 
gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca 6660 
ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg 6720 
ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt 6780 
cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc 6840 
ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct 6900 
tcgggaagcg tggcgctttc tcaatgctca cgctgtaggt atctcagttc ggtgtaggtc 6960 
gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta 7020 
tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca 7080 
gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag 7140 
tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag 7200 
ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt 7260 
agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa 7320_ 
gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg 7380 
attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga 7440 
agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta 7500 
atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc 7560 
cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg 7 620 
ataccgcgag acccacgctc accggctcca gatttatcag caataaacca gccagccgga 7680 
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agggccgagc 
tgccgggaag 
gctacaggca 
caacgatcaa 
ggtcctccga 
gcactgcata 
tactcaacca 
tcaatacggg 
cgttcttcgg 
cccactcgtg 
gcaaaaacag 
at actcatac 
agcggataca 
ccccgaaaag 



gcagaagtgg 
ctagagtaag 
tcgtggtgtc 
ggcgagttac 
tcgttgtcag 
attctcttac 
agtcattctg 
ataataccgc 
ggcgaaaact 
cacccaactg 
gaaggcaaaa 
tcttcctttt 
tatttgaatg 
tgccacctga 



tcctgcaact 
tagttcgcca 
acgctcgtcg 
atgatccccc 
aagtaagttg 
tgtcatgcca 
agaatagtgt 
gccacatagc 
ctcaaggatc 
atcttcagca 
tgccgcaaaa 
tcaatattat 
tatttagaaa 
cgtc 



ttatccgcct 
gttaatagtt 
tttggtatgg 
atgttgtgca 
gccgcagtgt 
tccgtaagat 
atgcggcgac 
agaactttaa 
ttaccgctgt 
tcttttactt 
aagggaataa 
tgaagcattt 
aataaacaaa 



ccatccagtc 
tgcgcaacgt 
cttcattcag 
aaaaagcggt 
tatcactcat 
gcttttctgt 
cgagttgctc 
aagtgctcat 
tgagatccag 
tcaccagcgt 
gggcgacacg 
atcagggtta 
taggggttcc 



tattaattgt 
tgttgccatt 
ctccggttcc 
tagctccttc 
ggttatggca 
gactggtgag 
ttgcccggcg 
cattggaaaa 
ttcgatgtaa 
ttctgggtga 
gaaatgttga 
ttgtctcatg 
gcgcacattt 



7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8484 



<210> 66 
<211> 53 
<212> DNA 

<213> Artificial Sequence 



<220> % 
<223> Description of Artificial Sequence: primer 

<400> 66 

gtcactcgag gactcggtcg actgaaaatg agacatatta tctgccacgg acc 



<210> 67 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 67 

cgagatcgat cacctccggt acaaggtttg gcatag 36 



<210> 68 
<211> 37 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 68 

catgaagatc tggaaggtgc tgaggtacga tgagacc 



<210> 69 

<211> 51 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 



<400> 69 

gcgacttaag cagtcagctg agacagcaag acacttgctt gatccaaatc c 



51 
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<210> 70 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 70 

cacgaattcg tcagcgcttc tcgtcgcgtc caagaccc 



<210> 71 , 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 71 

caccccgggg aggcggcggc gacggggacg gg 



<210> 72 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 

<400> 72 

atgggatcca agatgaagcg cgcaagaccg 



<210> 73 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 73 

cataacctgc aggattcttt attcttgggc 



<210> 74 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 74 

ggtacacagg aaacaggagg ttccggaggt ggaggagaca caactcc 



32 



<210> 75 



-59- 

<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 75 

atgggatcca agatgaagcg cgcaagaccg 30 



<210> 76 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 76 

cactatagcg gccgcattct cagtcatctt 



30 



